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Preface 


The aim of the present book is to discuss theoretical foundations of cosmology. And 
nothing more, but possibly also nothing less. Therefore, it is not a textbook in any 
sense of this word. One of its principal goals is to study the assumptions upon which 
relativistic cosmology is founded, and, first of all, those assumptions that. are implicit 
in the mathematical structure of cosmology. 

The leading idea of the book may be formulated in the following way: The Uni- 
verse is a structure. This structure can be investigated with the help of either local 
or non-local (global) mathematical methods. In the first case, we have theoretical 
physics, in the traditional meaning of this word (i.e. in the sense of “local physics”). 
In the second case, we have cosmology. In this sense, cosmology is a non-local physics. 
In the spirit of this approach, I use, from the very beginning, global mathematical 
methods, even in these parts of the material which are usually regarded as only a 
preparation to cosmology. 

It is worth noticing that cosmology, understood as a non-local physics, is some- 
thing more than a theory of cosmological models. Cosmology in such a narrow sense 
(as a theory of world models) appears in the book not earlier than in the last two chap- 
ters; I think, however, that, when analysing the global structure of space-time in all 
foregoing chapters, I am studying truly cosmological issues. Both special and general 
theories of relativity are presented with stress on their respective global aspects. 

I assume that the reader is familiar with mathematical methods which are em- 
ployed throughout the book; although, for his or her convenience, I usually give main 
definitions with short comments and some examples. In this way, I hope, the reading 
of the book could be smooth enough without having too often to consult other text- 
books or monographs. However, I am aware that more responsible acquaintance with 
the book would sometimes require the aid of some other publications. Bibliographical 
notes, at the end of every chapter (with the exception of chapters 5 and 6), could 
assist the reader in this respect. 

I would like to stress once more that my intention is not to teach the reader how 
to do calculations (I assume that he already knows it), but to clarify concepts and 
show the efficiency of structures. 

Definitions, theorems, examples, and so on, are numbered in a continuous manner 
within each section; the method of numbering is natural and needs no explanation. 
Only the definitions of those concepts which play a role in the exposition have been 
numbered, definitions of all other concepts having been incorporated into the text. 

The organisation of the material is the following. In the first two chapters founda- 
tions and assumptions of the contemporary theory of space-time are discussed. The 
second chapter contains elements of special relativity. I am especially interested in 
the existence of the Lorentz structure. Since it is, in principle, a global problem, it 
conveys — in my opinion — information of cosmological significance. The third chap- 
ter is a concise repetition of the fibre bundle method, one of the main theoretical tools 
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used in the book. With the help of it the general theory of relativity is presented in 
the fourth chapter. The method, on the one hand, makes manifest similarities of this 
theory with gauge theories of contemporary physics and, on the other hand, stresses 
the peculiarity of Einstein’s theory of gravity as a theory of the frame bundle over 
space-time. The fifth chapter occupies the central place in the book. It poses, to 
use Einstein’s expression, “the cosmological problem”, and explicitly formulates the 
leading idea of the entire book. Although no single formula appears in this chapter, 
it is by no means a popular exposition: its full understanding presupposes a consid- 
erable mathematical and cosmological culture by the reader. Analyses carried out in 
this chapter (and a program it outlines) are applied to the simplest world models, 
the so-called Robertson—Walker cosmology, in the sixth chapter. It goes without 
saying that the Robertson—Walker cosmology is reconstructed with the help of global 
methods (the concept of warped product plays a key role here). 

The rather extensive Appendix (of the size of an average chapter) is aimed at 
revealing a beautiful logic inherent in the evolution of space-time theories, starting 
from the Aristotelian physics and ending with general relativity. The application of 
the fibre bundle method, helped — if necessary —- by some concepts of the mathemat- 
ical category theory, has allowed us to reconstruct former theories of space-time in 
the modern geometric framework, and to show a “natural” character of the Einstein 
theory of relativity within the context of this evolutionary chain. It turns out that 
general relativity is logically the almost unavoidable “next step” along the path of 
development. In my opinion, it is a strong, albeit purely esthetic, argument on behalf 
of Einstein’s theory. 

At the end of most of the chapters there is a section entitled Comments and 
Remarks. Here I consider some problems more loosely connected with the content 
of the given chapter, and sometimes I allow myself digressions of a philosophical 
character. 

I am fully aware that the book leaves many problems either completely untouched 
or non-satisfactorily treated. Such problems as: Bianchi cosmology, physics of the 
early Universe, further chapters of the global theory of space-time (including the 
classical singularity problem), many questions of observational cosmology, although 
well (and sometimes very well) developed as far as their technical aspects are con- 
cerned, still await a more careful methodological elaboration. Each of these problems 
(and many others as well) deserve a separate monograph. The Bibliographical Essay, 
after the sixth chapter, containing a list of books on cosmological topics with short 
remarks and explanations, is intended to recompense the reader for the fact that the 
book ends at a point where the most interesting action only begins. 


M. d. 
March 1985 
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Manifold Model of Space-Time 


0. Introduction 


Each physical theory contains cosmological elements, usually as tacit assumptions, 
since each physical theory develops on an “arena”, and the theory of such an arena 
can be thought of as a part of cosmology. Macroscopic physical theories occur in 
space-time and all of them presuppose a certain structure of it, called differential 
manifold structure or simply manifold structure. This structure is common to all 
macroscopic physical theories which differ from one another only when structures are 
taken into account that are superimposed on the manifold structure. The problem 
of a suitable arena for theories of microphysics is still an open question. Quite a 
common opinion asserts that, in quantum mechanics also, manifold continues to play 
its role of an arena for physical processes. Even if the opponents of this opinion are 
right, one has to admit that space-time modelled by differential manifold preserves its 
importance in all quantum theories: all measurements that constitute the empirical 
base of such theories are performed within the macroscopic space-time, and they 
must appear, in this or another way, in the mathematical formalism of these theories 
(here all discussions have their origin concerning the theoretical status of space-time 
in theories of microphysics). 

To construct the manifold model of space time is the goal of the present chapter. 
Its larger parts are of a purely mathematical character. Sec. 1 prepares, so to speak, 
the mathematical environment for the theory of differential manifolds. It presents, in a 
more or less informal manner, some information on functional spaces, and in particular 
on Banach spaces, and their role in modern differential geometry as well as some 
preliminary remarks concerning the geometric model of space-time. Sec. 2 contains 
the definition of differential manifold (in the case of a finite number of dimensions) 
and introduces the most important concepts of the manifold theory. Sec. 3 deals 
with one of such concepts, namely the concept of manifold orientability. Vector and 
tensor fields are introduced in sec. 4, and the notions of immersion, embedding and 
submanifolds appear in sec. 5. Mathematical tools, prepared in this way, enable us 
to present in sec. 6 a succinct construction of the manifold model of space-time. 
Sec. 7 contains comments and remarks more loosely connected with the main topic of 
the chapter. Among others, the attention of the reader is drawn to some difhiculties 
inherent in the very concept of space-time manifold, which made impossible the full 
implementation into a physical theory the so called Mach’s principle, once postulated 


2 1. Banach Spaces and Manifolds — Mathematical Environment ... 


by Einstein and broadly discussed till now. At the end of sec. 7 some remarks can be 
found concerning the status of space-time manifold in theories of the microworld. 

Mathematical material presented in this chapter, as well as in a few next chap- 
ters, is of a repetitive character, therefore definitions are stressed rather than theorems 
and their proofs, mutual dependencies between concepts that will be needed later on 
rather than particular examples. In the “Bibliographical Note” at the end of the 
chapter, the reader can find information concerning books on the manifold theory. 
He is encouraged to choose something from that list. Clear mathematical perspective 
will make further reading of this book easier. 

The model of space-time, as a four -dimensional differential manifold, presented 
in this chapter is very general. It will be gradually enriched in the following chapters. 


1. Banach Spaces and Manifolds — Mathematical Environment 
of the Space-Time Theory 


The general concept of function appeared in mathematics not earlier than the begin- 
ning of the XIX century. It was Riemann who, in his famous inaugural lecture On 
the Hypotheses That Lie at the Foundations of Geometry, first noted the necessity to 
investigate functional spaces. Further development of the function theory consisted 
mainly in a transition to the study of functions whose domains and ranges are not 
limited to the axis of real numbers or to spaces with finite numbers of dimensions. 
Owing to this transition, functional spaces were put into their natural environment, 
namely the general theory of topological spaces. 

Dwelling within the region of elementary mathematics creates a dangerous illu- 
sion. One usually deals with “well-behaved” functions, for instance, with continuous 
or differentiable (as many times as needed) functions, with functional spaces hav- 
ing sufficiently smooth structures, etc. This creates an impression that well-behaved 
functions exist in general, and “pathological cases”, such as noncontinuous or nondif- 
ferentiable functions, are rare exceptions. An instant of reflection (for which seldom 
a time is left in a spate of computations) is enough to see that things are just the 
opposite. There are well-behaved functions that constitute exceptional regions (of 
“measure zero”) within the realm of all possible functions. 

Non-maliciousness of God, of which Einstein spoke so often, reveals itself in that 
in his architectural plans the Creator has used well-behaved functions or at least such 
functions that can be approximated by well-behaved functions. The functions being 
elements of the Banach space belong to this privileged class. 

A linear normed and complete space is said to be a Banach space. This defi- 
nition can be thought of as a natural generalization of the concept of vector field in 
the Euclidean space. To each element r of a Banach space (to each “vector” ) one 
ascribes a real number |[|z||, called the norm of the element (a generalization of a 
vector’s length). The norm is defined with the help of known axioms that formalise 
its properties. Linearity of a Banach space ensures that one can add its elements 
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to each other and multiply them by numbers (real or complex). Completeness of a 
Banach space means that every Cauchy sequence reaches its limit in this space. 

The properties expressed in the above definition of Banach space guarantee that, 
on the one hand, this space is a very general object containing a rich class of special 
cases often met in the mathematical practice and, on the other hand, it is a well 
behaved object eliminating many pathological situations which would require special 
caution from the mathematician. 

In particular, from the definition of Banach space it follows that a metric can 
be introduced in this space. Indeed, the expression ||z,; — x2||, where z; and z2 are 
elements of a Banach space, satisfies the metric axioms. Therefore, Banach spaces 
are metrizable. 

One of the main advantages of Banach spaces is that they constitute a natural 
environment for calculus. The generalization of the derivative in the space R” (which 
is a special case of a Banach space) to the derivative in Banach spaces in general is so 
immediate that many theorems, obtained in this way, look as if they were mechanically 
repeated from the elementary course of differential calculus. In fact, this procedure is 
far from being trivial. In the transition process to general Banach spaces, the notion 
of a derivative of a function is generalized to the concept of a mapping from one 
Banach space to another. In this way, functional analysis acquires one of its most 
fundamental tools. 

Calculus can also be done in spaces more general than Banach spaces, for in- 
stance, in any topological vector space. However, in such a case, the generalization is 
not that natural (for example, the implicit function theorem can no longer be valid 
without additional assumptions). 

The theory of functional spaces is a powerful tool of modern differential geometry. 
Differential geometry has grown up from analytic investigations of curves and surfaces 
in the three-dimensional Euclidean space. The generalization went in two directions: 
first, from three to any number of dimensions (also infinite) and, second, towards 
getting rid of the “surrounding” Euclidean space. In this way, the crucial concept of 
modern differential geometry was born, namely the concept of differential manifold 
modelled on a topological vector space of any (also infinite) number of dimensions 
or, in particular, on a Banach space. In the latter case one speaks about a Banach 
manifold. 

The manifold concept is a generalization of the parametric representation of a 
surface in R®, i.e. of a C* mapping of an open subset of a given subset into R*. In 
the case of an infinite -dimensional Banach manifold, the space R? is replaced by an 
infinite-dimensional ‘Banach space. 

Let X be any set and B a Banach space. The set of pairs (U/,,¢,), called local 
charts, such that 

(1) the sets U; (i € Z) cover X, 
(2) the mappings ¢; are C* bijections from open subsets U, onto open subsets of B, 
(3) the sets ¢(U; U;), for each pair 2,7 € Z, are open in B, and mappings of the 
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type 
$506; ': ¢(U;NU;) —> $;(UiN U;) 


are of C* class, 
is called the C* atlas on X modelled on the space B. Two atlases are called equivalent 
if their union is an atlas. The equivalence of atlases is an equivalence relation. 

A set X together with an equivalence relation of C* atlases on X modelled on 
the Banach space B is said to be a C* differential manifold modelled on the Banach 
space B. The dimension of such a differential manifold is, from definition, the di- 
mension of B; it can be infinite. The atlases that belong to the equivalence class 
appearing in the above manifold definition are called atlases admissible on X. 

Infinite dimensional manifolds modelled on Banach spaces, in short also called 
Banach manifolds, are in fact functional spaces; however, one can introduce on them 
a majority of structures which are natural generalizations of structures known from 
the elementary course of differential geometry. For instance, tangent and cotangent 
spaces, and consequently vector and tensor fields as well as differential forms, can be 
defined on them. In particular, symplectic forms and Hamiltonian vector fields are 
naturally introduced on Banach manifolds. In close analogy to the finite-dimensional 
case one can construct various fibre bundles over Banach manifolds and define fields 
of different geometric objects as cross-sections of the corresponding fibre bundles over 
these manifolds. Tensor fields and differential forms turn out to be special cases of 
such constructions. In this way, one can also introduce a metric (the field of a metric 
tensor) on a Banach manifold, changing it into a Riemann manifold. The geometry 
of such manifolds is further constructed in analogy with the finite-dimensional case. 

The spaces R” (and C”) are Banach spaces with the norm defined in a natural 
way, and differential manifolds modelled on R” (or C”), (their precise definitions will 
be given in the next section), play a key role in the traditional differential geometry; 
they are simply called differential manifolds (real or compler). Differential manifolds 
are locally diffeomorphic to R” (or C”), and, therefore, at each of their points there 
exists tangent vector space with the tangent vectors defined in an “intrinsic manner”, 
i.e. with no reference to any space in which the manifold would be immersed. On the 
one hand, differential manifolds preserve well-known properties of surfaces immersed 
in the Euclidean spaces, and, on the other hand, they are entirely autonomous in the 
sense that, although modelled on R” (or C”), they do not need to be considered with 
reference to any “surrounding space”. 

Owing to these properties, in the contemporary theoretical physics four—dimen- 
sional (real) differential manifolds serve as models of space-time.’ This remains true 
both with respect to classical physics and with respect to special and general rela- 


'The reader should pay attention to a certain fuzziness of commonly accepted terminology: 
one speaks of manifolds modelled on Banach spaces, and one speaks on the physical space-time 
modelled by a four-dimensional manifold. Modelling in the former sense is determined by the 
manifold definition; modelling in the latter sense is one of the most fundamental procedures used in 
theoretical physics, whose methodological aspects are studied by philosophers of science. 
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tivity. It turns out that space-times of pre-Newtonian theories (those of Aristotle 
and Galileo) can also be modelled by four-dimensional differential manifolds (see, 
Appendix). The manifold model of the physical space-time is a subject matter of 
the present book. In this introductory section, I want only to place this space-time 
model in its natural mathematical environment, namely in the general theory of dif- 
ferential manifolds modelled on Banach spaces. Real differential manifolds constitute 
a very special class of Banach manifolds, but nevertheless they are general enough to 
provide description tools of a rich family of phenomena occurring in the real universe. 
It seems that a certain tension between generality and peculiarity of mathemati- 
cal constructions is a key characteristic of the conceptual structure of contemporary 
physics. 

The concept of normed spaces was introduced by Stefan Banach in 1922 in his doctoral thesis. 
From Banach’s work, Theorie des operations lineatres (1932) the “mature age of normed spaces” 
began (Bourbaki 1969). The manifold concept started to crystalise itself in the XIX‘ century, simul- 
taneously in several branches of mathematics, where the necessity to investigate locally Euclidean 
spaces arose. In 1854, in his outspoken inaugural lecture, Riemann made some remarks which turned 
out to be important for the development of the manifold concept (see Riemann 1868). The next 
step belongs to Poincaré: in 1880, when studying certain topological techniques in the theory of 
differential equations, he introduced what is to-day called flows on a manifold. A precise definition 
of a smooth two-dimensional manifold was given by Weyl! in 1913, in his book The Concept of a 
Riemann Surface (see Wey! 1955). Understanding of the manifold concept evolved rather labori- 
ously in the first half of the XX'* century to become, after 1950, one of the most crucial chapters of 
modern topology and differential geometry. 


2. Differential Manifolds 


A topological space X that is homeomorphic to R” in the neighbourhood of each of its 
points is said to be a topological manifold. In the following, we shall assume — unless 
this assumption is directly revoked — that topological manifolds have the Haussdorf 
property, i.e., any two different points of such a topological manifold possess disjoint 
neighbourhoods. The existence of a local homeomorphism between X and R” should 
be understood in the sense of the following definitions. 

A pair (U, ¢), where U is an open subset of X, and ¢: U — V a homeomorphism 
onto an open subset V C R” is said to be a (local) chart on the manifold XY. By 
definition n is the dimension of the topological manifold. Coordinates (z',...,2”) of 
the image ¢(z) € R” of a point x € X are called (local) coordinates of the point zx in 
the chart (U,¢); this chart is also traditionally called a (local) coordinate system on 
X. It is assumed that X is covered by open sets U, being domains of charts on X. 

In the following, we shall additionally assume that the set {U,} covering X is 
countable (X as a topological space has a countable base). However, this assumption 
is not a part of the manifold definition. Because of future applications it 1s conve- 
nient to accept that X is a connected space, or at least that the dimension of X is 
the same in the neighbourhood of each of its points (this property follows from the 
connectedness of X). 


6 2. Differential Manifolds 


Fig. 1.1. Compatibility of maps on topological manifold. 


Two charts (U.,¢a) and (Ug, ¢g) on X are said to be C* consistent if — assuming 
that U, Ug # @ — the mappings 


$8 °¢,': Pa(Ua MU) — $6(UaN Usp), 
gag): bp(Ua Ug) — ba(Ua M Up) 


from open subsets of R” into open subsets of R” are of class C* (see Fig. 1.1). 
The compatibility condition means that there exists a smooth (in the sense of C*) 
transition from coordinates (z'), 7 = 1,...,n, defined on Uy, to coordinates (y') 
defined on Ug, in the region on which U, and Ug overlap, i.e., 


(2°) tr (y') = F(29; 


where f? are real functions of class C* (and analogously the other way round: (y*) + 
(c')). k may be also infinite. If the mappings ¢g 0 $71 (¢. 0 oe) are analytic, we 
write k =w. 

The set of charts {(Ux,¢a)} covering X (ie. X = UU.) and pairwaise C* 
consistent on overlapping domains, is said to a C* atlas A on X. Two C* atlases are 
said to be C* equivalent if their union is a C* atlas on X. 

2.1. Definition. The pair M = (X,{A}), where X is a topological space, 
and {A} a set (an equivalence class, in fact) of equivalent C* atlases on X, is said 
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to be a C* differential manifold (or C* manifold, for short). If there is no danger of 
misunderstanding, we will often omit the indication of the differentiability class (C*) 
in the name of manifolds. If k = w, a manifold is said to be analytical. 

In practice, one works with one atlas treating it as a representation of the entire 
equivalence class. We shall often make use of this simplification.? 


2.2. Example. The space X = R” can be given differential structure by defining 
an atlas consisting of the one chart (U.,¢), where U = R", ¢ = id. 
2.3. Example. To define a differential structure on a one-dimensional sphere 


St = {(2",27) ER (2'? + (2) =), 
an atlas is needed consisting of at least of two charts (U., da), a = 1,2, (see Fig. 1.2) 


U,=S*\{(-1,0}, ¢:U,—-D, CR’, 
Ua=S* \f(n0)}, o2: U,— Dp CR 


The mappings ¢, can be defined in the following way, 


?1': Or => (cos O,sin9), —7<O<7, 
¢,': Or > (cos O, sinO), 0<O<22 


(see, Thirring 1978, pp. 10-11). 


D2 
D 2°07 
! 1 2°91 Ne 5B 
>; rt 
on 1°92 
U; U2 


Fig. 1.2. Atlas on S'. For clarity the set S' is shown in two copies. Little circles, triangles, and 
parentheses indicate which points correspond to each other. 


2 Against a more common usage we prefer the term differential rather than differentiable in the 
name C* manifolds, understanding the difference between these two terms more or less in the same 
manner as one understands the difference between metric and metrizable. In this sense, not every 
topological manifold is differentiable, but if it is C! differentiable, it is also C™ differentiable. 
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If, in the chain of definitions leading to the C* manifold concept, the space R" 
is replaced by its “upper” (“lower”) part, i.e., by the region in R” satisfying the 
condition u! > 0 (u! < 0), where wu! is a coordinate in R” (this region will be denoted 
by }R”), one obtains the so-called C* manifold with boundary. The boundary 0M 
(sometimes denoted also by M) of a manifold M with boundary is defined to be the 
set of those points of M, the images of which under ¢q are situated on the boundary 
of iR” in R”; the mappings ¢. define local coordinate systems on M. 

2.4. Example. Let X = [a,b] Cc R!. Any atlas on X must consist of at least 
two charts (Ua, ¢.), @ = 1,2; for instance, 


U; Sheet), gi: Ter —a, 


U2 = (a, 9], go: r- b—. 
Then OM = {a} U {6}. 


Any C* manifold can be also constructed in a different way, more clearly showing 
the role of the modelling space R”. The construction recipe is the following (see 
Bishop and Goldberg 1980, pp. 31-33). Prepare a suitable number of disjoint copies 
of R", and from them cut out the open regions that will be modelling local coordinate 
systems on the manifold. The open regions which are to be overlapping with each 
other should be cut out of different copies of R”. Now identify those points of the 
open regions which are transformed into each other when the transition is made from 
one local coordinate system to another local coordinate system. To be more precise, 
on the set P, being the union of disjoint open regions of different copies of R”, one 
defines an equivalence relation €, treating as equivalent those points of different open 
regions which correspond to each other under local coordinate transformations. The 
manifold is to be defined as the quotient space M = P/E. A topology on M can be 
introduced by demanding that local coordinate transformations be homeomorphisms. 
Such a topology is called a manifold topology. In short, a manifold can be glued 
together out of a sufficient number of pieces of R”. Whitney (1936) has proved that 
the two ways presented above of defining manifolds are equivalent. 

2.5. Definition. Let M = (X", {A}) and N = (Y?, {B}) be two C* manifolds, 
and (U, ¢) and (W, #) their local charts at points z € U and y € W, respectively. Let 
us consider a mapping f: X" — Y” (Fig. 1.3). The expression po f o ¢7! represents 
the mapping f in the charts (U,¢) and (W,~). The mapping f is C’ (r < k) at a if 
yo fog! isC” at d(x). It can be shown that this definition is independent of the 
choice of a chart. If a mapping f is C’ at each point x € X", f is said to be of class 
C” (or simply to be C’). 

If Y? = R! (Fig. 1.4), f is called a function on X", and the expression f o $7! 
its representative in the chart (U, ¢). 

2.6. Definition. If a mapping f: X” — Y? (of class C*) is a bijection, and 
if both f and its inverse f~' are continuously differentiable (are C*). f is called 
(C*) diffeomorphism. Two manifolds (of class C*) between which there exists a (C*) 
diffeomorphism are called (C*) diffeomorphic. 
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Fig. 1.3. Mapping f:X" > Y?. 


R! 


Rez 


o(x) 


Fig. 1.4. Function f on manifold. 


In the theory of manifolds, diffeomorphic manifolds are indistinguishable from 
each other; in other words, there exists a category, called the category of C* manifolds, 
the objects of which are C* manifolds and morphisms are differentiable mappings of 
class C* between C* manifolds. C* diffeomorphisms are isomorphisms in this category. 
(See Trautman 1970b, 1973b; Sulanke and Wintgen 1972; concise information on the 
category theory is given in the Appendix, sec. 1.) 
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From the theorem proved by Whitney (1936) it follows that the theory of n— 
dimensional manifolds can be identified, in a sense, with the theory of subsets R”™ 
with m < 2n +1. 


3. Orientability of Manifolds 


Let (x') and (y') be local coordinate systems on open subsets of R®. The coordinate 
systems are said to have the same orientation if the determinant of the Jacobian 
matrix J = det(Oxr'/Oy’) is positive at each point of a given open subset of R”. The 
orientation of a chart (U,¢) of a manifold M is defined through the orientation of 
coordinate systems on ¢(U) C R”. A manifold is said to be orientable if there is 
an atlas on it such that any two charts (U;, 4) and (U2, ¢2), the domains of which 
overlap (U; N U2 # @), have the same orientation. A manifold defined in terms of 
such an atlas is said to be oriented. A manifold which is not orientable is said to be 
nonorientable. 

Two atlases of a given manifold are said to possess the same orientation if all 
their charts have pairwaise the same orientations. To be of the same orientation is for 
the atlases an equivalence relation. An equivalence class of this relation is called an 
orientation of this atlas. Every connected manifold admits exactly two orientations. 

The space R”, considered as a manifold with an atlas consisting of one chart 
(R",id) (see example 2.2), is an orientable manifold. The Mobius strip can serve 
as an example of a nonorientable manifold. In the case of two-sided surfaces in the 
three-dimensional Euclidean space, orientability is equivalent to two-sidedness of a 
surface (the Mobius strip is a one-sided surface). (For more information see Brickell 
and Clark 1970, pp. 119-125; Bishop and Goldberg 1980, pp. 162 -164). 

There exists a visual method of deciding whether a given manifold is orientable 
or not. Let us associate a graph G(A) with each atlas A = {(Ua,¢.)} of a manifold 
M = (X,{A}) in the following way. Let the knots of the graph G correspond to 
the charts ¢, of the atlas A, and the edges of the graph, linking the corresponding 
knots, to the connected components of the overlapping regions U, MU, # 0. All 
these correspondencies are assumed to be one-to-one. Two knots of the graph can be 
linked by countably many edges. To all edges of the graph let us ascribe the numbers: 
+1 if the Jacobian matrix in the connected component of the region U, M Ug that 
corresponds to that region is positive, and —1 if this Jacobian matrix is negative. 
In this way, each knot is linked with itself, forming a loop to which a number +1 
is ascribed. The sequence of edges forming a loop will be called a cycle. To the 
sequences of edges (and consequently, to cycles as well) we ascribe a number, being 
the product of numbers which correspond to the edges forming this sequence (this 
cycle). We say that the change of orientation of a knot takes place if all numbers, 
corresponding to edges emanating from this knot, are multiplied by —1. Let us notice 
that this procedure does not change the number corresponding to the loop emanating 
from, and returning to, a given knot (this number is twice multiplied by —1). A graph 
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is said to be orientable if, by suitably changing the orientation of its knots, one can 
obtain the situation in which to all edges of the graph G the numbers +1 are ascribed. 


+1 


+] +1 


-1 


Fig. 1.5. Graph corresponding to the Mobius strip. 


Of course, the change of orientation of a knot belonging to a graph G corresponds 
to the change of orientation of the chart ¢, associated with this knot and belonging 
to the atlas A of a given manifold M. A manifold M is orientable if and only if the 
graph G corresponding to its atlas A is orientable. It can be shown that a graph G is 
oriented, and consequently the manifold corresponding to it is oriented, if G contains 
no cycle to which the number —1 is ascribed. (See Sulanke and Wintgen 1972). 


4. Vector and Tensor Fields on Manifolds 


The fact that manifolds are modelled on the spaces R” implies the existence of the 
tangent space 7,(M) to a given manifold M = (X",{A}) at each point x € X” 
that is isomorphic to R”. (If misunderstanding can be excluded we shall also write 
ze M.) 

Let us consider two differentiable functions f; and fz on a manifold M. These 
functions are said to have the same germ at a point z € M if there is a neighbourhood 
of x on which f; and f, coincide. The germ of a function f at x is ~- from the 
definition — the equivalence class of functions having the germs at x equal to that 
of f. Germs of functions at x form a vector space (with respect to addition and 
multiplication by real numbers). Moreover, since the product of two germs at a point 
x (defined as the pointwise multiplication of functions) is a germ at zr, the germs at. 
x form an algebra. 

Finally, let us recall that a linear mapping satisfying the Leibniz rule is called 
derivation. Now, a vector tangent to a manifold M at a point x € M can be defined 
as a derivation on the algebra of germs of differentiable functions at .r. The set of all 
vectors tangent to M at x is called the tangent space to M at x and is denoted by 
T,(M). Of course, tangent spaces to a manifold are vector spaces (with respect to 
addition and multiplication by real numbers). 

One could translate the above abstract definitions into a language honoured by 
the handbook tradition, and say that the tangent vector v, to a manifold M at a 
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point z € M is a linear mapping from the space F(N) of differentiable functions, 
defined on a neighbourhood WN of z, into the space R’, i.e., 


vr(af + Bg) = av.(f) + Bre(9), 


where a, € R! and f,g € F(N); moreover, the mapping is supposed to satisfy the 
Leibniz rule 
ve(f9) = f(z)ve(g) + g(z)vz(F). 
The set of all such tangent vectors to a manifold M at x € X is the tangent 
space T,(M). 
4 7?) 


T,(X") id Cys) 


7, (Y?) 
TA) v® 2 


Fig. 1.6. Derivative f’ and pull back f* of a mapping f. 


Let us consider the mapping 
f: X" — Y? 


differentiable at c € X". The derivative of this mapping at z is defined to be a linear 
Mapping 

f'(z): Te(X") > T,(Y?), 
transforming v € T,(X") into w € T,(Y?) in such a way that, for every function h 
at the point y = f(z) € Y?, one has w(h) = v(ho f) (Fig. 1.6). It can be easily 
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seen that under the mapping f‘(x) a tangent vector to a curve C in X at a point z 
is transformed into the tangent vector to the curve f(C) in Y? at the point y. 

Let V and W be vector spaces. The set IL(V,W) of all linear mappings from V 
into W is a vector space (with addition and multiplication by scalars, defined in a 
natural way). The vector space IL(V,R!) is called dual to VY and is denoted by V*. 
The space 7,7(M), dual with respect to tangent space 7,(M), is called the cotangent 
space to the manifold M at a point z; its elements are called covectors. The following 
names are also used: contravariant vectors — for elements of T7,(M), and covariant 
vectors — for elements of 7,*(M). 

Let us consider a differentiable mapping 


(PR 


and a covariant vector 0 € 7/(Y?). We define the pull back of 6 (induced by the 
mapping f) as a linear mapping 


fe) — Tee), 


transforming @ intow € TJ,*(M) in such a way that, for any v € 7,(X"), the functional 
equality (f*6)v = 0(f'v)o f is satisfied (or the numerical equality (f*@),v, = 9,(f'v)y 
(see Fig. 1.6)). 

And consequently, the pull back of a function g at a point y = f(z) € Y? (induced 
by the mapping f) is defined as f*(g) = go f (or more precisely: f*(g) + (go f)(z)). 

Let V;,...,V, and W be vector spaces. A mapping f: V) x---x V, — W, linear 
with respect to each variable, is said to be a multilinear mapping. Let 7; € Vi, 2 = 
1,...,. We define the multilinear mapping 


mer iieg: Vi Xue X Vz — R' 


by 
™ @::-®@ Ga(¥1,- a atte) = (aby), 2: ra eel) 


where v; € V;. This function is called the tensor product. 
4.1. Definition. A real multilinear function 


fr Vt xe. KV" xX Vx x V—>R! 
_——— ee 


r times s times 


is called a tensor of the type (r,s) (over the space V). Tensors of the type (r,s) form 
the vector space 
Gen be arate y*= SEL Y 
Yew erere (V) 
r times s times 


(or in the abbreviated form @’V @* V* = T’,(V)). If the dimension of V is n, 
dim Y =n, then dim7’,(V) = n’t*. Tensors of the type (0,0) are scalars, tensors of 
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the type (1,0) are contravariant vectors, and tensors of the type (0,1) are covariant 
vectors. 

A tensor of the type (r,s) on a manifold M is defined to be a real multilinear 
function on 


@" T.(M) 8 T(M) = T's(Te(M)). 


A tensor field T of the type (r,s) on a manifold M is defined to be a mapping 
M — 7T’,(M) such that, for every z € M, one has T(x) € T',(7z(M)). A tensor 
field of the type (1,0) on M is a field of contravariant vectors on M; a tensor field 
of the type (0,1) (of the class C*) is also called a one-form. A tensor field is called 
symmetric (antisymmetric or skew-symmetric) if, at each point r € M, T(z) is a 
symmetric (antisymmetric) tensor. Totally antisymmetric tensor field of the type 
(0, p) (of the class C*) is called a p-form. 

The pull back mapping can be generalized to any p-form. Let f be a differen- 
tiable mapping 

FX ee 


and w any p-form on Y%. The pull back f*w of the p-form w (induced by the map- 
ping f) is defined in the following way, 


PONG) Hal fur. foe 
where v; € 7,(A%)) and!) fae fae): 


5. Submanifolds 


Let f:N? — Mp? be a differentiable mapping from a differential manifold NV? to a 
differential manifold M?. This mapping is said to be of rank r at a point y € N? 
if the dimension of f'(7,(N%)) is r. The following, almost self-evident, relationships 
hold: 

if the rank of f at y is g (= dim), the induced mapping f’ at y is injective, 
and q < p; 

if the rank of f at y is p (= dimM?), the mapping f’ at y is surjective, and 
q 2 Pp. 

If the rank of the mapping f (of the class C*) at each point of the manifold N° is q, 
f is called the immersion (of the class C*), and the manifold N is said to be immersed 
in the manifold M?. Let us note that the set f(A/%) need not carry the manifold 
structure since f need not be injective (however, if f is an immersion, f’ is always 
an injection; this immediately follows from the above statements and definitions). 
Immersion that is also an injection is called embedding, and one correspondingly 
speaks of an embedded manifold into one another. If f is an embedding, the set 
F(N%) C M? with the differential structure induced by f is a manifold, although this 
differential structure need not be consistent with the original differential structure on 
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Mp?; if it is consistent, f is said to be a regular embedding. If fis a regular embedding, 
f(N?) is called? a submanifold of M?. 

An open subset of a manifold M? considered, in a natural way, as a submanifold 
of M?, is called an open submanifold of M?. 


6. Manifold as a Model of Space-Time 


All macroscopic theories of contemporary physics — and theories that have played 
important roles in the history of physics (provided they have been reconstructed in 
the language of modern mathematics, see Appendix) — presuppose that the arena 
in which physical processes occur has the structure of a four-dimensional differential 
manifold. Points of this manifold are called events. Events are determined by their 
four coordinates in local charts. Local charts are treated as local reference frames (see 
below, sec. 7, A). One of the coordinates in each of such charts is considered to be the 
time coordinate, the remaining three — space coordinates. This distinction of time 
and space coordinates has a purely local meaning: it is defined only in a local chart. 
One should expect that the possibility to cover the entire manifold with a single chart 
(i.e. the possibility of an atlas consisting of one chart, as it is the case in R”) is a rare 
exception. Since from the mathematical point of view time and space coordinates 
are on equal footing, one often prefers speaking of four coordinates of events or — 
to be more precise — saying that a four-dimensional differential manifold serves as 
a mathematical model of the physical space-time. With less care, but more easy, one 
says simply that the space-time is a four-dimensional manifold. Therefore, it can be 
asserted that space-time is a set of all possible events, where by an event one should 
mean an “idealized occurrence in the physical world having extension neither in space 
nor in time” (Geroch 1978, p. 3), for instance, an idealization of an elastic collision 
of two particles. 

The four-dimensionality of space-time is an empirical fact. A physical counter- 
part of the dimension is a number of the degrees of freedom of a physical system. For 
instance, three degrees of freedom in a translational motion correspond to the three 
dimensionality of space. The possibility to move along only one temporal direction is 
reflected in the one dimensionality of time. Let us notice, however, that the “arrow” 
along this direction (the so-called arrow of time) has not been determined by the 
model. The fact (known from the very personal experience) that physical processes 
can “move” only from the past to the future has no explanation within the present 
model. At most, we can, in a purely conventional manner, call one end of the tem- 
poral direction the future, and the other and — the past. In the following, we shall 
often make use of this convention. 

3Qne should note the difference in terminology used by different authors (see, for example, 


Choquet-Bruhat et al. 1982, pp. 239-242; Bishop and Goldberg 1980, pp. 40-43; Kobayashi and 
Nomizu 1963, chapt. 1, sec. 1; Brickell and Clark 1970, pp. 68-83; O'Neill 1983, pp. 15-21; Hawking 


and Ellis 1973, pp. 23-24). 
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Usually, it is assumed that a manifold modelling the physical space-time is 
orientable, and, additionally, that it is separately time and space orientable (this 
will be defined in chap. 2, sec. 1; see also below sec. 7, B, and sec. 1. 3 in Sachs and 
Wu 1977). 

The manifold model ascribes to space-time not only the continuity but also (in 
general) a “sufficiently smooth” continuity. Usually, it is assumed that all mappings 
appearing in the definition are of class C*, where k is as large a number as it is required 
by a given physical situation. Most often, one simply assumes* that a modelling 
manifold is of class C™. 

The postulate of continuity leads to the question of the status of this concept 
in microphysics. According to quite a common opinicn, it is a smooth space-time 
manifold that continues to play the role of an arena for physical processes in quantum 
mechanics. However, one could quote convincing reasons supporting the view that a 
smooth space-time is a macroscopic concept: we coin this concept within the world of 
macrophysics, in which all measurements are performed providing us with information 
about the micro—world, and then “projecting it down” into the domain of quantum 
phenomena. Such a view is corroborated by an analysis of the mathematical structure 
of quantum mechanics. Unfortunately, this problem goes beyond the limits of the 
present book: in the following — except for sporadic remarks — we will deal only with 
space-times of macroscopic physical theories which are always modelled by smooth 
manifolds. 

Finally, one should note an interesting methodological problem: can the manifold 
model of space-time be defined operationally (or at least quasi-operationally)? And, 
if the answer is positive, how can it be defined? According to Trautman’s (1964) 
recipe, a space-time manifold can be constructed with the help of “negligibly small” 
clocks, the relative positions of which are parametrised by triples of real numbers. 
In the construction proposed by Synge (1965), a fundamental tool is again a clock, 
and a fundamental operation — a time measurement. However, this author tacitly 
assumed the existence of signals that would inform the observer on distant events. A 
construction, free from such gaps, was proposed by Ehlers, Pirani and Schild (1972). 
In their approach, free fall and light propagation serve as “elementary operations” 
(see also Ehlers 1973). These operations allow one to construct not only the manifold, 
but also the metric structure of space-time (see, next chapter and Heller 1977). 


7. Comments and Remarks 


A. Local charts on a manifold correspond to local frames of reference in space-time 
(although not all coordinate transformations a physicist would consider as changing 
to new frames of reference (see, Dixon 1978, pp. 2-7). Full discusion of this problem 


4Some authors, when speaking of a C* space-time, have in mind the fact that the metric defined 
on this space-time is of class C* (see, for instance, Tipler et al. 1980, pp. 155-157). 
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is possible only after a metric has been introduced on a manifold (see, chapter 2, 
sec. 5, D). The fact that a manifold admits — on the strength of its definition — all 
atlases equivalent to a given atlas (see above, sec. 2) shows a certain “relativity” of the 
space-time model discussed here: the very concept of space-time includes all possible 
classes of reference frames consistent with it, and none of these classes is a priori 
distinguished in any way. Some classes of reference frames may become privileged 
only afterwards, when the space-time considered is enriched by additional postulates 
which should always be physically motivated. 


B. The problem of space-time orientability, space orientability and time ori- 
entability belongs to the most fundamental problems of physics and cosmology. Hav- 
ing in mind that at the present stage of our analysis we have no “generally valid” 
method to split space-time into space and time separately, let us refer to interesting 
remarks of a mathematician, Laurent Schwartz (1967, vol. 2, chapt. 6, sec. 5), who 
after asking the question 

“Is our universe an orientable manifold?” continues: Let us put aside the relativistic point of 
view that teaches us to look at the universe as a four-dimensional manifold, and let us assume that 
our world is modelled by a three-dimensional C™® manifold. Is it orientable? Do the known laws of 
physics allow us to equip it with a canonical orientation? To begin, let us assume that our world, 
considered as a three—-dimensional manifold, is not orientable. This means that in the Universe there 
exist some paths, similar to those in the Mobius strip, such that starting a journey along one of them 
with a certain orientation, and pursuing the orientation continously during the travel, one comes 
back to the staring point with the opposite orientation. A human being travelling throughout the 
Universe along such a path, after coming back to the Earth, would find that its heart is at the right 
side of its body, and its liver at the left side. Letters in a book, read during the travel, would now 
tun from right to left. All this without any sudden change experienced by the traveller during his 
trip. Anyway, after his coming back he would judge himself absolutely normal and “invariant”. On 
the other hand, he would think that all phenomena on the Earth had been “reversed” during his 
absence. Schwartz concludes that all arguments for the world’s orientability, quoted in elementary 
textbooks ‘of physics and mathematics, rest on a very weak basis. 


C. From the times of Einstein, there exists in physics a set of ideas called the 
doctrine or the principle of Mach. These ideas go back to Leibniz and Berkeley; 
however, they were read out by Einstein from the works of Ernst Mach in which 
this author criticised the Newtonian mechanics. Roughly speaking, Mach’s principle 
postulates a causal relationship between the local physics and the global structure of 
space-time (for an account of Mach’s original doctrine, see Heller 1988). In its more 
maximalistic formulations Mach’s principle requires that the global structure of the 
world should uniquely determine the local physics®. (For more details see Goenner 
1970, Reinhardt 1973, Heller 1975a, Raine and Heller 1981, Raine 198la, Barbour 
1983). As the number of quoted works testifies (and this is only an “unfair sample” 
of the continuously increasing literature), discussions on the subject are still going 
on. It seems that the status of Mach’s principle in the theory of general relativity 


5In the original Einstein’s formulation Mach’s principle postulates the “relativity of mass”: the 
mass of a test body should be uniquely determined by the distribution and motions of all masses in 


the Universe. 
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has been clarified in the works of Raine (1975, 1981a), but Mach’s ideology manifests 
some tendencies to go beyond this theory and to provide new inspirations in creating 
new physical theories (see, for example, Barbour 1974, 1983; Barbour and Bertotti 
1977; Raine and Heller, chapter 13). I do not want now to go deeper into this set of 
problems; I would like only to note that the manifold model of space-time imposes 
substantial restrictions upon the possibilities to implement Mach’s doctrine into a 
physical theory. Any manifold has — from its very definition — local topological 
and differential properties identical with those of the modelling space R”. Although 
this circumstance leaves a certain freedom to manipulate the local structure of space— 
time with the help of field quations (see, Raine 1975, 1981a), this does not change the 
fact that space-time must locally be R” irrespectively of its global structure. This 
limitation can be removed only by postponing modeling space-time by a differental 
manifold (see, below, comment E). 


D. The manifold concept has been defined in terms of the modelling space R”. A 
manifold can be also constructed by suitably gluing copies of R” (see, above, sec. 2). 
In this sense, the arithmetical space R” is a priori with respect to the manifold 
concept. However, it is not a logical necessity, but first the result of our decision 
on what a manifold is, and second the question of agreement of this decision with 
empirical data. Under the present precision of empirical data the model can, in 
principle, be discarded and replaced by a new one. Kant was wrong when asserting 
that the Euclidean character of space is an a priori condition of any experiment. 


E. A host of empirical data testifies that, within the domain controlled by macro- 
physics, there is no need to give up the manifold model of space-time, although it 
is not impossible — as we have seen — that, in microphysical theories, space-time 
appears from outside, as it were, namely as a consequence of the fact that an ob- 
server performs all his measurements being immersed in the macroscopic space-time. 
The necessity to go beyond the manifold model of space-time could appear within a 
domain of quantum gravity. There are strong reasons compelling one to believe that 
a quantisation of gravity — first of all, at very early stages of the world’s evolution, 
in the vicinity of the initial singularity — could hardly be avoided. Although some 
other proposals are known, it seems highly probable that the quantisation procedure 
would require giving up (or rather to generalize) the manifold model of space-time. 
As an example of this view, let us quote from Trautman’s work: 


“The topological and differential structures of space-time do not seem to possess a well-defined 
operational meaning. Therefore, it is likely that they will have to be abandoned, or rather replaced 
by another structure which will be more closely related to, and influenced by, physical phenomena 
than the absolute, locally Euclidean manifold structure of space-time assumed in all current theories. 
In my opinion, a satisfactory quantum theory of space, time and gravitation will have to do away 
with the notion of a differentiable manifold as a model of space-time” (Trautman 1973b, p. 183). 


Of course, one should expect that the future, long-expected quantum theory of 
gravity would give, as its limiting case, the present manifold model of space-time. 
For this reason this model ought to be regarded as a permanent achievement of 
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contemporary physics. 


Bibliographical Note. Fundamental information on Banach spaces could be found in the 
two-volume book of Maurin (1976, 1980), a more extensive study of functional analysis in the 
classical monograph by Riesz and Nagy (1972), and a very transparent introduction in the book 
by Roman (1975, vol. 2). Chapter 7A of the book by Choquet-Bruhat et al. (1982) gives a short 
account of differential manifolds modelled on infinitely—-dimensional topological spaces. 

The concept of (finitely-dimensional) differential manifold is presented in all handbooks of 
modern differential geometry. From among many excellent books in this field, the two volumes by 
Kobayashi and Nomizu (1963) merit our special attention. This book is often quoted by physicists, 
and many works of differential geometry written after 1963 are based on it and often contain gen- 
eralizations or applications of its results. More concise, but also very good, accounts of differential 
geometry can be found in Bishop and Crittenden (1964) or Hicks (1965). The books by Auslander 
and MacKenzie (1963), and by Brickell and Clark (1970) are devoted entirely to manifolds; the 
last one contains many useful examples. The reader interested in an introductory exposition should 
consult Wallace (1968) or O’Neill (1966). The first of these books deals with a topological aspect of 
manifolds, the second is an elementary, but modern, exposition of differential geometry with a num- 
ber of computational exercises; the manifold concept is introduced gradually as a generalization of 
the concept of plane in the Euclidean space. The book by Schutz (1984) is especially recommended 
because of its didactic values. The reader is introduced to both modern geometric structures and 
computational techniques. The selection of the material has been made in view of future applications 
in physics; such applications are also discussed. Conceptual foundations of the manifold theory are 
very clearly discussed in a little book by Isham (1989). 

The textbook of differential geometry written by Sikorski (1972) is exceptional in a certain 
sense. The exposition of differential geometry is here based on a concept of differential space, 
introduced by the author. A differential space is defined to be the pair (M,C) where M is any 
set, and C a family of real functions on M, satisfying two conditions that axiomatically define the 
“smoothness” of functions belonging to C, namely: (1) C is closed with respect to localization, (2) 
C is closed with respect to superposition with Euclidean functions. To be more precise, let (M, 7) 
be any topological space, and C any non-empty family of functions on M. A function f, defined 
on NV C M, is said to be a local C -function if, for every p € NV, there is a neighbourhood U in the 
topological space (N,Tn), where ty is a topology induced in N by 7, and a function g € C such 
that fly = glu. The set of local C-functions is denoted by C4; one obviously has C C Cy. The 
family of functions C is said to be closed with respect to localzzation if C = Cy. Let, further, € be 
the set of all C® functions defined on R” called smooth Euclidean functions. C is said to be closed 
with respect to superposition with smooth Euclidean functions if, for any function w € € and any 
nE€N, the fact that f;,...,f, € C implies that wo(fi,..., fn) € C. The differential space concept 
is a vast generalization of the manifold concept. Every subset of R” is a differential space, but there 
exist many differential spaces which can be embedded in no R® (even if n = oo). In particular, 
each differential manifold is a differential space. A diffeomorphism between two differential spaces 
can be defined in a natural way. By postulating that a differential space is locally diffeomorphic to 
a Euclidean space, one recovers the concept of a differential manifold. It turns out that, in spite of 
its generality, differential space is a workable concept. In particular, one can meaningfully define a 
tangent space to a differential space at any of its points. Large parts of differential geometry can be 
developed on differential spaces proper (i.e. on such differential spaces that are not manifolds). It is 
worthwhile to notice that Walczak and Waliszewski (1981) have published a collection of exercises 
(in general, difficult), especially adapted to Sikorski’s book. Penrose and Rindler (1984) define 
space-time manifold as a pair (M,C), without however, generalizing it to space-time differential 
spaces. 

In Gruszezak et al. (1988, 1989) a proposal has been worked out to model space-time by 
a differential space rather than by a differential manifold. Such a generalized space-time model 
can find its applications in a quantum gravity regime (when space time is still a differential space 
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but already not a differential manifold), and seems to be very promising in describing space-time 
singularities. Unfortunately, Sikorski (1972), and Walczak and Waliszewski (1981) exist only in 
Polish. For a foreign reader an extensive review by Heller et al. (1989) could be useful of differential 
space methods and its possible applications to space-time geometry. 

There exist many expositions of differential geometry, written especially for physicists. Those 
by Choquet-Bruhat et al. (1982), Bishop and Goldberg (1980), and O’Neill (1983) are the best for 
our purposes. Useful information concerning the differential structure can be found in the second 
volume of the book by Richtmyer (1981, chapters 23-28). The reader interested in a short review of 
modern methods in differential geometry should consult a paper by Schmidt (1972) or introductory 
chapters of such relativity theory textbooks as: Misner et al. (1973), Schutz (1985), Ryan and 
Shepley (1975); the last one is on a more advanced level than the first two. Thirring (1978) contains 
a transparent presentation of elementary concepts of the manifold theory (with applications to 
classical physics) and a number of instructive exercises. 

The manifold model of space~time has been submitted to a detailed analysis by Hawking and 
Ellis (1973), and Trautman (1964, 1968, 1970a, 1973b, 1976, 1978). See also the book by Raine and 
Heller (1981), and the works of the present author (Heller 1981a, 1986). 


2A 


Chapter 2 


Relativistic Model of Space—Time 


0. Introduction 


The space-time model introduced in the previous chapter is very general. To this 
property it owes its applications in all major theories of macroscopic physics. The 
manifold structure ensures, among others, the existence of local coordinate systems 
on a manifold, and the possibility to do calculus on it. Because of that, concepts such 
as: position, velocity and acceleration are well determined (at least locally) in all 
macroscopic space-times. However, the manifold model is too poor to allow the full 
physics to “develop” on it. In particular, such important concepts as space and time 
measurements cannot be defined within the conceptual framework presupposed by 
this model. Space and time measurements become meaningful only after introducing 
a metric on a manifold (or on certain of its submanifolds, as it takes place, for 
instance, in the case of space-time of classical mechanics; see Appendix). The remark 
in parentheses suggests that various physical theories will differ from each other in 
the way they introduce a metric structure on a space-time manifold. In the present 
chapter, we will analyse the metric structure demanded by the relativistic theories of 
space-time, i.e, by special and general relativity theories, that is to say the Lorentz 
metric structure. 

The question “what are the (necessary and sufficient) conditions of the existence 
of the Lorentz metric on a differential manifold?” is of uppermost importance for 
theoretical physics. If we agree with Maxwell’s saying that in physics one does not 
know of which one speaks unless one has a recipe how to measure, we must admit 
that there is no meaning in speaking of space-time physics as long as there is no 
metric defined on a corresponding space-time manifold. Therefore, the question of 
the Lorentz metric existence is, in a sense, the question of the possibility of physics. 
The metric structure is very rich. It contains several other structures (conformal, 
projective, affine, ...). These structures can be introduced axiomatically starting from 
the bare topological and differential structures. In this way, by gradually enriching the 
construction with new properties, ensured by subsequent axioms, one finally obtains 
the full Lorentz manifold (see Ehlers et al. 1972). However, in this chapter we will 
adopt a different strategy: we introduce a “ready-made” Lorentz metric on a manifold 
which implicitly contains all other afore-mentioned structures; they will be used and, 
if necessary, further analysed in subsequent parts of the present book. 

Mathematical richness of the Lorentz structure is paralleled by its paramount 
importance in physics. As it is expressed by Sachs and Wu (1977b, p. 12): 


“(The Lorentz metric] g somehow remembers the right things and forgets the wrong ones. 
Concepts of causality, distance, time, velocity, speed, acceleration, rotation, rigidity, simultaneity, 
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orthogonality, gravity, and so on, are derived from g to the extent they are retained at all. g therefore 
must play many roles; its unifying power is remarkable.” 


In the present chapter, the organization of the material is the following. Section 1 
introduces a metric on a manifold, changing it into a metric manifold, and sec. 2 gives 
a physical interpretation to this procedure. Section 3 presents, in a more detailed 
manner, a special case of a metric space-time model, namely the Minkowski space— 
time which constitutes a geometric arena for the special theory of relativity. Finaily, 
in sec. 4, necessary and sufficient conditions are given for the existence of a Lorentz 
metric on a differential manifold. Besides the standard treatment of this problem, its 
formulation in the language of cohomology groups is briefly presented. In Comments 
and Remarks (sec. 5) we elucidate a relationship between the existence of the Lorentz 
structure on a manifold and the concept of time. Results obtained in this chapter 
seem to suggest that a certain form of temporality is a necessary condition for physics 
to happen on a manifold. 


1. Metric Manifolds 


1.1. Definition. Let X be a differential manifold. A C* metric (or C* metric field) 

on X is a C* tensor field of the type (0,2) on X, satisfying the following conditions: 

(1) g is symmetric, i.e., for every x € X, the tensor g, (g at a point x) is a symmetric 
tensor; 

(2) for every x € X, the bilinear form g,(. , .) is non-degenerate, i.e., g,(v, w) = 0, 

v,w € 7,(X), for all v, if and only if w = 0. 

One speaks also of the metric tensor g. A differential manifold with a metric, 
defined in this way, is said to be a Riemann manifold (or a Riemannian manifold). 
Sometimes one says that a manifold X is equipped with a Riemann structure (or 
a Riemannian structure). A Riemann manifold is said to be a proper Riemannian 
manifold if, for every 0 # v € T,(X), c € X, one has g,(v,v) > 0. If this is not the 
case, one speaks of a pseudoriemannian manifold. 

A metric tensor g equips the tangent vector spaces 7,(X), z € X, with scalar 
products g,(v,w) = gijv'w’, v,w € T,(X). As it is well-known, in the Euclidean 
space, with the help of the scalar product, one can measure lengths of vectors and 
angles between vectors. These operations are naturally transferred to Riemannian 
manifolds. In particular, the norm (“length”) |lv|| of a vector v € T,(X) is defined 
as |\v||? = g2(v,v) = gijv'v?. In pseudoriemannian manifolds, it can be ||v|| = 0, for 
v # 0; such a vector v is called null (or lightlike). 

With a suitable choice of a basis (e;) for J-(X) the quadratic form q = g-(v, v) 


can be written as 
k n 
qg= gyv'v’ = — Vv!) + er 
=I i=k+1 
The quadratic form q is often denoted by ds? and is also called a line element of 
the manifold M. The basis, in which the quadratic form g,(v,v) assumes the above 
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form, will be called pseudoorthogonal. The number k, often denoted by J also, is 
called the index of q; and the number (n — J) — I its signature.1 Because of the 
metric continuity, the index and signature are the same for all c € X, provided the 
manifold X is connected. If J = 0 (or J = n), the manifold is of course a proper 
Riemannian manifold. 

1.2. Definition. A manifold for which J = 1 (or J = n — 1) is said to be a 
Lorentz manifold (Lorentzian or sometimes hyperbolic). Vector spaces T,(X) tangent 
to Lorentz manifolds are said to be Minkowski manifolds. Correspondingly, one speaks 
of Lorentz or Minkowski metrics. 

All non-zero vectors v € T(X), where X is a Lorentz manifold, are classified 
(independently of the choice of a basis) into timelike, null, and spacelike according to 
whether g,(v,v) is less than, equal to, or larger than zero. The set of all null vectors 
L, € T,(-X) is called a light cone at x. The light cone L, separates all timelike vectors 
in 7,(.X) from all spacelike vectors in 7,(X). The set T, of all timelike vectors in 
T,(X) is an open set in 7,(X) and has two connected components, T* and Ty, which 
are conventionally called the future and the past of x. Analogously, the light cone 
L, has two connected components, L* and Lz, which are called the future light cone 
and the past light cone of x, respectively. 

The set T =U, Tz, t € X, of all timelike vectors in a Lorentz manifold X can 
have one or two connected components (see Sachs and Wu 1977b, 1.2). Let X bea 
connected Lorentz manifold. If the set JT has two components, X is said to be time 
orientable; if T has one component, X is said to be time nonorientable. If one of 
the two components of T is defined to be future oriented, and the remaining one as 
past oriented, the manifold X is called time oriented. One should notice that the 
geometry itself does not distinguish any of the two possible time orientations. At this 
stage of our analysis the choice of one of them is a matter of convention. 

Let X” and Y” be differential manifolds with metrics g,; and gz, correspondingly. 
A mapping f:X" — Y™, which is a diffeomorphism and preserves the metric, i.e., 
91 = f*g2, is said to be an isometric mapping or shortly an isometry; in such a case, 
the manifolds X" and Y” are said to be isometric. A Riemann manifold, isometric 
with the manifold (R”,id) with the metric g = )\_, €,dxr'dr', where €, = 41, is 
called a flat space. If € = +1, for every 2, the manifold is called a Euclidean space 
and is usually denoted by €”. Minkowski spaces have the metric 


1 
n = —dx°dz® + Ss dr‘dz’, 


| 


therefore, they are flat spaces. The bilinear form 7 is called a Minkowski metric. 


1QOne should notice different definitions of index and signature by various authors, and also 
different conventions concerning signs of some geometric formulae. A list of these conventions can 
be found on the inner side of the cover of the textbook by Misner et al. (1973). 
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2. Metric Model of Space-Time 


Differential manifolds equipped with a metric structure model spaces or space—times 
of many macroscopic physical theories. For example, instantaneous spaces of clas- 
sical mechanics are proper Riemann manifolds, more precisely, three-dimensional 
Euclidean spaces. One should note, however, that space-time of classical mechanics 
is not a metric manifold, since there is no meaningful operation in it allowing one to 
measure space-time intervals between pairs of non-simultaneous events (see Raine 
and Heller 1981, chapt. 7, and Appendix at the end of the present book). Physi- 
cal spaces modelled by metric manifolds need not be Euclidean spaces. For instance, 
instantaneous spaces of the Friedman—Lemaitre cosmological models are three-dimen- 
sional, not necessarily flat, Riemann manifolds (see, e.g., Bondi 1965, Weinberg 1972, 
Sciama 1975, Raine 1981b). 

Successes of special and general relativity have convincingly demonstrated the 
efficiency of Lorentz manifolds in modelling the physical space-time. The Lorentz 
structure is much more subtle than the proper Riemann structure, and it turns out 
that many important physical results are but almost trivial conclusions following 
directly from the Lorentz structure.?, With no exaggeration one can say that four- 
dimensional manifolds with a Lorentz structure play the role of a standard model 
of the macroscopic space-time in contemporary physics. In the rest of this book 
we shall exploit this model. Now, we make a convention that — unless something 
to the contrary is directly expressed — by a physical model of space-time we shall 
understand the pair (M,g), where M is a four-dimensional C* manifold with respect 
to which all remarks of chapter 1, sec. 6 are valid, and g is a C’ Lorentz metric, 
with the +2 signature, defined on M.* We shall assume — unless directly expressed 
otherwise — that both the manifold M itself and the Lorentz metric g, on it are 
continuously differentiable as many times as it is necessary; in practice, it means 
that M and g are of class C°. The pair (M,q) will be called a relativistic model of 
space-time. 

Now, some specifics must be introduced. As it can be easily checked, any 
isometry is an equivalence relation and, strictly speaking, it is not a single pair (M, g) 
but an equivalence class of such pairs that is a model of space-time; two pairs (My, 1) 
and (Mo, g2) are equivalent, if they are isometric, i.e. if there is a mapping f:M, > 
Mz, such that f*g2 = g,. A metric model of space-time does not distinguish between 
two manifolds if they are isometric. We should keep this in mind, although one usually 
works only with one pair (M, g) treating it, in a sense, as a representative of the entire 
equivalence class (see Hawking and Ellis, p. 56). Traditional physics is interested in 
local properties of space-time; cosmology, from its very nature, contemplates global 
properties of space-time. And here we are met with a subtle, but rich in consequences, 


?One can see it clearly in the book by Sachs and Wu (1977b) aimed at presenting general relativity 
theory in the form of a mathematical text, as far as it is possible. In consequence of this endeavour 
significant parts of the theory changed into concise corollaries of: mathematical theorems. 

3From the manifold definition it follows that r < k. 
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difficulty: how to guarantee that we are dealing with the “whole of space-time”, that 
no part of if has been “artificially” cut off? First of all, one must make more precise 
intuitions contained in this question. The following definitions serve this end. A 
space-time (M’,g’) is said to be a C’” extension of a space-time (M,g) if there is 
an isometric C’ embedding of the manifold M into the manifold M’, f:M —> M’. 
A space-time is said to be C’ ineztensible if, for every its isometric C’ embedding, 
M — M’, one has f(M) = M’, i.e. space-time is inextensible if it is identical with 
every its extension (see Hawking and Ellis 1973, pp. 58-59). Geroch (1970) and 
Clarke (1976) have shown that every Lorentz manifold can be extended as long as it 
becomes inextensible. 

Inextensibility is a mathematical counterpart of the intuitive concept of “to- 
tality” of space-time. In the following, we shall assume — unless directly stated 
otherwise — that all space-times we shall be dealing with are inextensible in a suit- 
able class of smoothness. It is evident that the specification of this class of smoothness 
is crucial. If we diminish smoothness requirements, an inextensible space-time may 
become extensible. The question how to suitably choose the smoothness condition 
has far-reaching consequences in cosmology (see Tipler et al. 1980, pp. 155-158). 

Usually, one ascribes some other properties to a metric manifold as a space- 
time model (besides its inextensibility), for instance: orientability, orientation (see 
chapt. 1, secs. 6 and 7B), time orientability, etc. (see Sachs and Wu 1977b, sec. 1.3). 
At the present state of our analysis we shall not assume such additional postulates; 
they will be discussed and, if necessary assumed, later on along with development of 
our presentation. 

Only after a metric model of space-time has been established, the question 
becomes meaningful: which are the largest and the smallest scales for which this 
models preserves its validity? The question about the upper limit of space-time 
scales is clearly connected with the cosmological extrapolation (in space and time); 
this problem will inevitably come back in the following pages of the present book. 
The question about the lower limit of space-time scales is one of the key problems 
of the present subquantum physics (see remarks in chapt. 1, sec. 7, E). In the view 
of Hawking and Ellis (1973, p. 57), there is empirical evidence that space -time has 
a structure of a smooth manifold down to the scales of 107-!® cm. The standard 
cosmological model suggests that this limit can be further lowered down to the Planck 
length, i.e., to scales 10~*° cm (see, for instance, Hawking and Ellis 1973, pp. 362-364; 
Misner et al. 1973, pp. 1196-1203; Demianski 1985, pp. 281-283). Doubtlessly, these 
problems belong to the most fascinating questions of contemporary physics. 


3. Space-Time of Special Relativity 


The pair (M,7), where M is a four dimensional manifold and 7 the Minkowski metric 
on M (see, sec. 1), is a space-time model in special relativity. This theory was created 
by Einstein in 1905, and it was Herman Minkowski to whom the theory owes its 
transparent geometric form (Minkowski 1907, 1908a, b; the last work is often quoted). 
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There is a basis (ea) = (€0,ex) in which the components of the Minkowski metric 
assume the form nog = diag(—1,1,1,1). This basis will be called pseudoorthogonal. 
The coordinate with the index zero is interpreted as time coordinate, the remaining 
ones as space coordinates?* 

The group of linear transformations 


P: (€) 9 a%eq + BF, 
where a% and Of are constants, such that 


a" uNap ao = Nv 


that is to say, preserving the bilinear form nagv%v", is called a Poincaré group or 
non-homogeneous Lorentz group and is usually-denoted by P(4). If b? = 0, the name 
a Lorentz or homogeneous Lorentz group is used; it is denoted by £(4). 

It can be shown that det{a%] = +1. If a > 0, the transformations, being ele- 
ments of the Lorentz (homogeneous) group, form the so-called ortochronous Lorentz 
subgroup; they preserve the time orientation (i.e., the direction of timelike and null 
vectors). 

If a% > 0 and det[a%] = +1, one obtains a subgroup, called a proper Lorentz 
group which is denoted by Lo. 

If a% > 0 and det{a%] = —1, the transformations contain time reflections 


be (€0, €k) e— L(—eo, ex) 


and space reflections 
8: (€9, ex) +> Leo, —ex); 


they do not form a subgroup (however, they do form a subgroup together with trans- 
formations for which det[a%] = +1). 

The full Lorentz group £(4) splits into four components: Lo, t£Lo, sLo, and stLo 
(see Choquet—Bruhat et al. 1982, p. 290; Carmeli 1977, chapt. 2). 

Special relativity is in fact a theory of invariants of the Lorentz group (or of the 
Poincaré group) together with a physical interpretation adjoined to it. The classi- 
fication of vectors into timelike, null and spacelike, and consequently null cones at 
each point of a space-time manifold (the so-called cone structure of space-time) is 
invariant with respect to transformations forming the Lorentz group. Curves to which 
the tangent vectors at all their points are timelike, null, or spacelike are called time- 
hike, null (lightlike), or spacelike, respectively. In the physical interpretation, timelike 
curves are treated as representing histories of particles (with a non-zero rest—mass) 
or of observers, and null curves as representing histories of photons (or more generally 
of zero rest-mass particles). Spacelike curves, in principle, could represent histories 


‘From now on we will stick to the following convention: Greek indices assume 0, 1, 2, 3 values, 
whereas Latin indices assume 1, 2, 3 values. 
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of tachions, i.e., of hypothetical particles moving faster than light. Since, however, 
no empirical results confirm such a hypothesis, in the rest of this book it will be 
systematically ignored.® 

In the theory of special relativity, geodesic curves (with respect to the connection 
defined by the Lorentz metric, see chapt. 4, sec. 6) are also identified as histories of 
freely falling particles of a non-zero rest-mass. This can be thought of as a version 
of the principle of inertia postulated by special relativity. 

The above interpretation of the cone structure of space-time, from the physical 
point of view, corresponds to its causal structure: it informs us on between which 
domains of space-time information flow (with the help of light signals or other signals 
slower than light) can take place (see Hawking and Ellis 1973, chapt. 7; Beem and 
Ehrich 1982, chapt. 2; the causal structure of the Minkowski space-time is presented 
in the first of these books on pp. 118-124). In this sense, the causal structure of space- 
time, as considered from a global point of view, has clear cosmological significance. 

With the help of the Minkowski metric the distance between two events in space— 
time can be defined in a natural way. However, let us note that this distance for two 
different events can be zero, provided that these two events can be joined by a light 
signal (see below, sec. 5, A). The existence of the Lorentz metric makes it possible 
to perform space and time measurements in different reference frames. This, in turn, 
allows one to construct a relativistic mechanics and, in further perspective, also a 
relativistic electrodynamics.® 

Everyday laboratory practice of high energy physicists gives a powerful empirical 
verification of special relativity. 


4. The Existence of the Lorentz Structure 


4.1. Theorem. A differential manifold M with an atlas {(U,,¢a)} (at least of the 
class C!) admits a (proper) Riemann structure if an only if M is paracompact.’ w 

Proof. (1) (proper) Riemann structure => M is paracompact (directly from 
the Stone theorem, see, for instance, Engelking 1975, pp. 336 and 342-343; even a 
stronger statement is true: every netrizable space is paracompact). 

(2) M is paracompact > (proper Riemann structure). First, let us recall an 
important notion. Let X be aC’ manifold. A family of C” functions @,, defined on 
X, is said to be a C” partition of unity on X if (a) the supports of functions O,, 


5In older literature, the term “world line” is used to denote the history of a particle or of an 
observer; this term was coined by Minkowski himself. For a stricter definition of “history” see 
chapt. 6, sec. 2. : 

®Let us note that the assumption of a finite velocity of light is contained in the causal structure. 
The numerical value of this velocity has to be taken from experimental results. 

7Reminder: a topological Hausdorff space X is paracompact if into each its open covering {U;} 
a locally finite covering {V;} can be mscrtbed. A covering {V,} of a topological space X is locally 
finite if for every z € X there is a neighbourhood having a non-empty intersection only with a finite 
number of sets V; € {V;}. Every compact set is paracompact but not vice versa. 
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supp O,, are compact, (b) for every x € X, there is a finite number of functions O; 
such that O, # 0, (c) 53, Ox(z) = 1 for each « € X. Further, let {(Va,Pa)} be an 
atlas on X. A partition of unity on X is said to be subordinated to the atlas {(Va, pa} 
if, for every k, there exists V, such that supp O, € Va. Now, let us come back to 
the proof. Let @; be a partition of unity subordinated to the atlas {(U,¢.)} on M. 
Further, let g; be a tensor of the type (0,2) on UW, having, in a certain basis (called a 
natural basis), the coordinates (g;)x: = 6,1. As it can be easily seen, a tensor 50g: 
of the type (0,2) defines a (proper) Riemann structure on M.® o 

Let us note that the argument contained in the above proof does not hold for a 
Lorentz structure. The sum )>, Ojg;, where g; are quadratic forms with the index 1, 
need not be quadratic forms with the index 1. Therefore, one must look for different 
criteria as far as the existence of a Lorentz structure on M is concerned. Before 
proving the theorem giving such a criterion, we must introduce a new concept. 

A function D, coordinating to every point z of a differential manifold X an h- 
dimensional subspace D(z) of the tangent space T,(M) is called an h-dimensional 
distribution on X.° If, for every point z € X, there exists a neighbourhood U of z 
and C’ vector fields V,,...,V, defined on YU such that, for every p € U, the vectors 
Vi(p),-.-,Va(p) form a basis of the subspace D(z), the distribution D(z) is said to 
be of the class C’. One-dimensional distribution on X is called a direction field on 
X. The concept of h-dimensional distribution turns out to be an efficient tool in the 
theory of partial differential equations of the first order (see Bishop and Goldberg 
1980, chapt. 3. 11-12). 

4.2. Theorem. A Lorentz structure exists on a paracompact differential man- 
ifold X (at least of the class C'), if and only if a continuous direction field exists 
on X. @ 

Proof. (1) Continuous direction field => Lorentz structure. X is paracompact, 
and consequently (on the strength of the preceding theorem) a (proper) Riemann 
structure g exists on X. Let u be a unit vector belonging to the direction field, i.e. 
u € D(x) C T,(X), where D(x) is a continuous one-dimensional distribution on X. 
Let us define 


Go(v, 0) = ge(v, v) — 2(ge(v, u))? 


with v € 7,(X). It can be easily checked that it is a Lorentz metric (e.g. in the 
four-dimensional case, by writing down the above expression in coordinates and sub- 
stittiing: u° = up 241, gae = Sag) 

(2) Lorentz structure => continuous direction field. Let g — as above — be a 
(proper) Riemann mrtric on X. One of the eingenvectors, to be found by solving the 


8From the definition of paracompactness it follows that a paracompact manifold can be covered 
by a denumerable set of local charts. In fact, the implication in the opposite direction is also true 
(see Geroch 1971, Appendix). 


°This concept has nothing in common with distribution as generalized functions (in the sense of 
Schwartz). 
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eigenvalue problem 

Gopu” = Agape, 
is timelike; it defines a continuous direction field on X. (See Choquet-Bruhat et al. 
1982, pp. 292-293; Geroch 1971). o 

Two comments should be given. Strictly cada in order to have a C” Lorentz 
structure on X one should havea C’ direction field on X: the existence of a continuous 
direction field on X implies only the existence of a continuous Lorentz structure on 
X. However, this is not a serious limitation since, as shown by Steenrod (1951, p. 25 
seq.), a continuous direction field on X can be approximated by a smooth direction 
field (in fact similar approximations are valid as far as continuous cross-sections of 
any vector bundles are concerned). 

Second, the existence of a continuous direction field on X is not only a necessary 
condition for the existence of a Lorentz structure on X, but also a Lorentz structure 
on X can always be chosen in such a way that the direction field in question be 
timelike in this structure. 

The existence of a direction field on a manifold is its topological property. As 
it is known from topology, every non-compact (but paracompact) manifold admits 
the existence of a continuous direction field; however, a compact manifold admits the 
existence of a continuous direction field if and only if its Euler—Poincaré characteristic 
vanishes. It turns out that — on the strength of one of a few possible Euler—Poincaré 
characteristic definitions — the existence of a Lorentz structure on compact manifolds 
is strictly connected with the structure of differential forms on these manifolds. Here 
is an outline of the idea. 

Let the space of all p-forms (of the class C*) on a manifold X be denoted by 
A(X) (see chapter 1, sec. 4). Since p-forms are antisymmetric, one must have p < n, 
where n = dim(X). The space A?(X) is closed with respect to the addition of p-forms 
and their multiplication by functions (of C* class). As it is well-known, an external 
product can be defined 


Az (AP(X), A'(X)) — AP™(X), 


allowing one to multiply a p-form a by a q-form £ to obtain a (p+ q)-form a A 3 
(more about differential forms on a manifold, see: Flanders 1963; Choquet-Bruhat 
et al. 1982, chapter 4, Bishop and Goldberg 1980, chapt. 4). 

The set A(X) of all forms of any order on X, together with an external product 
defined in it, is called an external algebra or a Grassman algebra. A Grassman algebra 
is a graded algebra, i.e. a set of sets {A?(X)}, indexed by the integers p = 0,1,...,n, 
together with an associated bilinear mapping A(X) x A(X) — A(X); in our case — 
an external product. An external differentation 


d: AP(X) — AP+(X) 


with the property dd = d? = 0, allows one to treat A(X) as a differential graded 
algebra. Let us note that the sets A?(X) are vector spaces (over R). 
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Let w € A?(X). w is said to be a closed form or a co-cycle, if dw = 0. w is called 
an ezact form or a co—boundary, if w = dO (of course O € AP~'(X)). 

Evidently, every exact form is closed (since d* = 0), but not vice versa. The study 
of closed forms that are not exact brings important information about topology of a 
given space in the following way. 

Let the vector space of closed p-forms on a manifold be denoted by Z?, and the 
vector space of exact p-forms on X by B?. Evidently, one has B? C Z?. The quotient 
space H? = Z?/B? is called a p-cohomology space of the manifold X. Since H? is 
an Abelian group with respect to addition it is also called a cohomology group of p> 
order of X. Equivalence classes of closed p-forms are elements of H?: two closed 
forms w; and wy are considered to be equivalent if they differ by an exact form, i.e., 
if Wy —- Wy >= dO. 

The dimension of the p'* order cohomology group of X is called a p Betti 
number of the manifold X and is denoted by b?. The sum x(X) = Yjeo(—1)78', 
where n = dim(X) is called an Euler—Poincaré characteristic of this manifold.'°° A 
compact differential manifold X can be equipped with a Lorentz structure if and only 
if y(X yen. 

Let us note that the vanishing of the Euler—Poincaré characteristic of a compact 
manifold X is a necessary and sufficient condition of the global existence of a Lorentz 
structure on it. A direction field, and consequently a Lorentz structure, locally always 
exist on X. 
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A. As is well-known, a (proper) Riemann metric on a differential manifold X defines, 
in a natural way, a (Riemann) distance function dpa: X x X — (0,00) on X, satisfying 
the usual axioms (including the triangle inequality). The topology determined by 
this function, the so-called metric topology, is identical with the original manifold 
topology (see chapt. 1, sec. 2). No Lorentz metric has these nice properties. In terms 
of the Lorentz structure, one can define a (Lorentz) distance function dp: X x X > 
(0, co] (which can assume infinite values!), but it satisfies axioms different from those 
satisfied by dp (in particular, it satisfies the so-called reversed triangle inequality). 
Any distance measured by dz, along non-timelike curves is, from definition, equal to 
zero. A topology determined by dy is essentially weaker than the manifold topology 
on X. Uniortunately, entering into details would require new concepts going beyond 
the scope of the present book; the interested reader should consult the book by Beem 
and Ehrlich (1981, chapters 1 and 3). 


'0The Betti numbers and the Euler-Poincaré characteristic of a given manifold can be also defined 
dually in terms of a triangulation of this manifold. Dual with respect to closed and exact forms are 
the so-called cycles and boundaries, respectively. In a similar manner one defines a homology group 


(or space) Hp as a quotient space Z,/Bp, where Z, is a vector space of cycles and By a vector space 
of boundaries (of the dimension p). 
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B. As we have seen, the existence of the Lorentz structure on a (paracompact) 
manifold is equivalent to the existence of a continuous direction field; moreover, the 
Lorentz structure can always be chosen so as to make the direction field timelike in this 
structure. This fact seems to have a deep cosmological significance; one can express 
it by saying that the existence of a Lorentz structure on a manifold is equivalent to 
the existence of a local arrowless time on it. The existence of a continuous timelike 
direction field on a manifold could be interpreted as a geometric counterpart of the 
fact that each point of the manifold feels a lapse of a local time. This lapse, however, 
is determined by a direction and not by a vector, i.e. two possible arrows along the 
same direction (the past and the future) are equivalent (the present structure does not 
distinguish between them). It is truly an arrowless time. And it is also a local time: 
a timelike direction field, besides the rather tolerant requirement of continuity, does 
not impose any global conditions; in particular, it provides no conceptual tools which 
could serve to compare “local times” at different points of the space time manifold. 

To sum up: a paracompact differential manifold can carry a Lorentz structure if 
any only if a continuous direction field exists on it, and this direction field, after the 
imposing a Lorentz structure on the manifold, can be interpreted as a local arrowless 


time (see Heller 1986, chapt. 4). 


C. If one agrees that without a Lorentz structure physics cannot be done on a 
manifold (see introduction to the present section), one must admit that the existence 
of at least a local arrowless time is a condition for the possibility of physics. A 
similar idea was expressed by C. von Weizsacker (1972). In his opinion, science 
establishes laws which explain experiments performed in the past, but are verified 
(or falsified) by predicting outcomes of future experiments, and by comparing them 
with the results of experiments when they are already present results; in this sense 
time is a precondition of physics. If one remembers that, from the methodological 
point of view, retrodictions are as important for science as are predictions (although 
psychologically predictions play doubtlessly a more important role), one is entitled to 
assert that the existence of an (at least) arrowless time is a precondition for physics 


(see Heller 1986, chapt. 4). 


D. Although a compact differential manifold with the vanishing Euler—Poincaré 
characteristic admits the existence of a Lorentz structure, it leads to some patholo- 
gies when considered as a model of space-time. Bass and Witten (1957) have shown 
that a four-dimensional compact manifold with the vanishing Euler-Poincare char- 
acteristic cannot be simply connected.'! Moreover, it turns out that in every such 
manifold at least one closed timelike curve must exist (see Beem and Ehrlich 1981, 
p. 23). Physically, this would mean the existence of a closed history of an observer 
or of a non-zero rest-mass particle (i.e., of a history which would endlessly repeat 
itself again and again). It can hardly be seen how one could reasonably make this 
consistent with requirements of the macroscopic causality (for interesting remarks 


1l}ts first Betti number must be different from zero. 
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concerning this problem see Hawking 1971, p. 396; Hawking and Ellis 1973, p. 189). 
No four-dimensional compact manifold equipped with a Lorentz structure can be 
simply connected. However, such a manifold could be treated as a non-compact 
Lorentz manifold in which some points have been identified. More precisely, for any 
n-dimensional manifold Xo, there is a unique simply connected manifold X covering 
the manifold Xo; X is called a universal covering of Xo or a manifold universally 
covering the manifold Xo (for definition see Spanier 1966, chapt. 2; Rychtmyer 1981, 
chapt. 24). Therefore, from the point of view of local properties, instead of a four— 
dimensional compact manifold one could take its universal covering as a model of 
the physical space-time. This covering is non-compact and simply connected (see 
Hawking and Ellis 1973, p. 190). 


E. Now, we are ready to take up the question posed in chapter 1 (sec. 7, A) con- 
cerning the mutual relationship between a coordinate system and a reference frame. 
Coordinate system is a mathematical concept which is equivalent to that of a local 
chart belonging to an atlas that defines a differential manifold structure. A reference 
frame is a physical counterpart of a coordinate system; however, not every trans- 
formation of coordinates would be treated by a physicist as a transition to a new 
coordinate system. For instance, an affine transformation of the form xz + az + 6, 
a,b € R (changing to new units and new zero point on a scale) or the transition 
from Cartesian to polar coordinates would not be considered as going to a new ref- 
erence frame. On the other hand, “rotating coordinates” with respect to previous 
ones would be regarded as being connected with a new frame of reference. Clearly, 
the difference between these two classes of coordinate transformations consists in the 
fact that transformations of the former class are independent of time whereas the 
transformation of the latter class do depend on time. Therefore, the definition of a 
reference frame must contain: (a) a definition of rest (with respect to this reference 
frame) and simultaneity; (b) a definition of the metric structure of time (t) together 
with a unit of time, and (c) a definition of the metric structure of instantaneous 
spaces (¢ = const) together with a distance unit. Owing to these definitions one can 
meaningfully speak separately of time and separately of space, but only with respect 
to a given reference frame (see Dixon 1978, chapt. 1, sec. 2). 


F. Clarke (1970) has shown that every Lorentz manifold can be smoothly em- 
bedded into a flat (pseudo)Euclidean space of suitably many dimensions; this em- 
bedding can be of an arbitrarily high differentiability class. In particular, every 
compact four-dimensional Lorentz manifold can be smoothly embedded into the 48- 
dimensional pseudo—Euclidean space with the index 2 (2 minuses and 46 pluses), and 
every non-compact four-dimensional Lorentz manifold can be smoothly embedded 
into the 89 dimensional Euclidean space with the index 2 (2 minuses and 87 pluses). 


Bibliographical Note. From among books on differential geometry, enumerated in the 
Bibliographical Note after chapter 1, only Choquet—Bruhat et al. (1982), and Bishop and Goldberg 
(1980) deal extensively with Lorentz manifolds. The monographs by Hawking and Ellis (1973) and 
by Beem and Ehrlich (1981) are entirely devoted to Lorentz manifolds and their applications to 
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space-time geometry (see also Heller 1981b, 1986 chapt. 3 and 4). The book by O’Neill (1983) 
should be especially recommended; it develops, step by step, the geometry of spaces with Lorentz 
metrics, and discusses they use in both special and general relativity. This book can also be treated 
as a kind of introduction to more advanced texts (like, for example, Hawking and Ellis (1973) or 
Beem and Ehrlich (1981)). 

The reader is urged to make a closer acquaintance with the theory of special relativity. Its basic 
ideas can be found in every modern textbook of physics. A good introduction, with a penetrating 
discussion of the conceptual framework of this theory, is Taylor and Wheeler (1966). First chapters 
of Schutz (1985) are equally good but much shorter. There exist books on special relativity of the 
monographic character, for example: Aharoni (1959), Synge (1965), Dixon (1978). The history of 
special relativity can be found in Miller (1981); this book contains also a new English translation of 
Einstein’s original paper (1905) and its extensive analysis. 
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Chapter 3 


Method of Fibre Bundles 


0. Introduction 


Fibre bundles functioned in physics for quite a time before their strict definition was 
formulated by mathematicians. For instance, as it is well-known, the Lagrangian 
of a dynamical system is a real function of coordinates and velocities of particles 
constituting this system. In other words, if M is an n-dimensional configuration 
space of a given system, the Lagrangian is defined on the space of all coordinates and 
(generalized) velocities, that is, on all tangent vectors to M at all points z of M. Such 
a space is called a tangent bundle over M, denoted by T(M), and it evidently has 
2n dimensions. A tangent bundle 7(M) is in fact a union of tangent spaces 7,(M) to 
the manifold M at all its points z € M. The space T(M) can be given the structure 
of a differential manifold; T(M) equipped with this structure is called a bundle space, 
and M its base space. A dynamical system can also be investigated with the help of 
the Hamilton method. The Hamiltonian of the system is a function of coordinates 
and (generalized) momenta. Since momenta are linear combinations of (generalized) 
velocities, one says that the Hamiltonian is defined on a cotangent bundle over the 
configuration space M, where the cotangent bundle over M, denoted by 7,7(M), is to 
be understood as a union of dual spaces 7,*(M) to the spaces 7,(M) for all z € M. 
The space T*(M) can also be equipped with the structure of a differential manifold. 
(For classical mechanics in the language of bundles, see Abraham and Marsden 1979, 
Thirring 1978, Arnold 1974.) 

The Kaluza~Klein theory can serve as another, perhaps even more sophisticated, 
example of the functioning of fibre bundles in physics before they were discovered by 
mathematicians. Kaluza and Klein, yet in 1921, aimed at unifying gravitational 
and electromagnetic interactions within the framework of a five~dimensional Rie- 
mann space on which a one-parameter isometry group was acting (see, for instance, 
Bergman 1976, chapters 17 and 18). Ez post it has turned out that the space, pos- 
tulated by Kaluza and Klein, is the so-called circle bundle, and the electromagnetic 
potentials, in their approach, are connection forms in it (see, Trautman 1980). 

After the Second World War the theory of fibre bundless began gradually to 
dominate differential geometry and many chapters of topology as well. Today, it con- 
stitutes a standard method of the so-called global analysis. After passing through 
the hands of mathematicians and obtaining a precise, fully formalized form, the fi- 
bre bundle method has come back to physics. First, as an efficient tool not only 
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introducing an order into individual physical theories (such as classical mechanics, 
see above), but also showing how some physical theories can be arranged into evolu- 
tionary chains revealing a certain logic of their development (see Appendix). Later 
on, the theory of fibre bundles was recognized as a suitable mathematical apparatus 
allowing one to geometrize gauge theories (see, e.g. Drechsler and Mayer 1977, Nash 
and Sen 1983). It has turned out that the case of the Kaluza~Klein theory was not 
an exception but rather a signal of a more general rule: gauge potentials, so-called 
by physicists, are essentially connection forms in principal fibre bundles. If one takes 
into account the fact that the unification of the electromagnetic and lepton (weak) 
interactions, achieved by Weinberg and Salam, has been obtained within the frame- 
work of the gauge scheme, and that there are strong reasons to believe that the same 
scheme will work in unifying strong (hadron) interaction with the electroweak one 
(the so called grand unification), and perhaps also with gravity (superunification), 
one can hardly overestimate the role of fibre bundles in contemporary physics (see, 
for instance, Taylor 1978, Ross 1981). 


Today it is also known that grand unification and, even more so, superunification 
can work only in an environment of extremely high energies. One estimates that the 
energies needed for grand unification and superunification are of order of 10!4 GeV 
and 10!° GeV respectively. Such a high energy can be supplied only by the Universe 
during its very early evolution. According to the standard cosmological model the 
energy necessary for grand unification was obtainable in the Universe 10~* s after 
the initial singularity, and the energy necessary for superunification 10~*4 s after the 
initial singularity. In this way, the unification of physics seems to be inseparably 
connected with cosmology. 


Einstein’s hope for unifying physics in a geometric framework was not in vain. 
It has only turned out that the traditional space time geometry is too poor for this 
end. Physical events cannot be mere points devoid of any structure. The heart of 
the matter consists in that that such a structure can be given to them by considering 
the set of suitable geometric objects (tangent spaces, all possible frames, an so on) at 
a given point of space-time, i.e. by considering a suitable fibre over this space time 
point. Collecting together all such fibres (over all points of space time) one obtains 
a corresponding fibre bundle with space-time as its base manifold. 


The reasons presented above sufficiently justify our aim to present general rel- 
ativity, with the perspective of its future cosmological applications, from the very 
beginning in terms of fibre bundles. We shall do that in the next chapter; in the 
present chapter we focus on the fibre bundle method itself. Section | introduces the 
concept of fibre bundles, and sec. 2 specifies the principal and associated fibre bun- 
dles. In sec. 3, the reader will find an outline of the theory of connections in fibre 
bundles; because of its importance in physics this theory is presented in a greater de- 
tail. In sec. 4, the geometry of a principal fibre bundle (connection and its curvature, 
structure equations, etc.) is constructed, and in sec. 5 this geometry is “projected 
down” to the base manifold of the bundle. The usual Comments and Remarks are 
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replaced in this chapter by a concise presentation of some applications of the fibre 
bundle method in gauge theories in physics. 


1. Fibre Bundles 


The fibre bundle concept is a generalisation of the Cartesian product. For instance, 
a cylinder can be presented as S! x I, where S! is a one-sphere and Z C R!, but 
something like that cannot be done with respect to a Mobius strip. However, it is not 
difficult to see that Mobius strip is “locally” a Cartesian product: if U is a proper 
subset of S', then U x T is “a part” of a Mobius strip. We say that Mobius strip is 
a fibre bundle over S! with Z as a typical fibre. This situation can be generalised in 
the following chain of definitions. 

1.1. Definition. The triple (€, 8,7), where € and B are topological spaces, and 
mw: € —» B a continuous surjective mapping called projection, is said to be a bundle. 

1.2. Example. Let A and B be topological spaces. The projection 74: Ax B = 
A is defined by m4(r4,7g) = t4, 24 € A,r € B. (Ax B,A,7,4) is a bundle. 

A bundle which is a Cartesian product (like that of example 1.2) is called a 
trivial (or product) bundle. 

1.3. Definition. A fibre bundle is a set of five elements €(B) = (€,B,2,G,F), 
where (€,8,7) is a bundle, € is called a bundle space (or a total space), B is a base 
(or a base space). It is assumed that B has an open covering {U;;j € 7}. The set 
a\(x) = F,, x € B, is called a fibre over x. It is assumed that fibres F,, for every 
xz € B, are homeomorphic to the topological space F, called a typical fibre. G is a 
group acting on F to the left; it is called a structural group of the fibre bundle. The 
following conditions are supposed to be satisfied: 

(1) The fibre bundle is locally trivial, i.e., there exists a homeomorphism 


yj: x (U;) Uj; x F 
given by ~;(p) = ((p), 6;(p)), p € m—1(U;), where 
$3 |p. = Pixs ix — F 


is a surjective homeomorphism, and the diagram 


is commutative. 
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(2) The consistency condition holds, i.e., for every x € U, NU, and every j,k € J, 
the homeomorphic mapping 


Pk,x © jn: FF 


is an element of the structural group G (see fig. 3.1). 


-1 
A $10 $2.2 €EG 
/ 


Fig. 3.1. Compatibility condition. 


(3) The mappings 
Vik: U; N U, > G, 


defined by 7;4(z) = 5,299; ws xr € U;NU;,, are continuous; 7, are called transition 

functions. 

1.4. Definition. Two bundles (€;,B;,7;), 1 = 1,2, are said to be equivalent if 
there exist mappings F': €; — € and f:B, — B, that preserve the bundle structure, 
i.e., such that the diagram 


B, a 


is commutative. The pair (F, f) is said to be a bundle morphism. The equivalence of 
bundles plays a similar role in the fibre bundle theory as the equivalence of atlases does 
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in the theory of manifolds! (see chapt. 1, sec. 2): two fibre bundles (&;, B;, 7, Gi, cas 
i = 1,2, are considered to be equivalent if treated locally, as the bundles (U; x 
F;, U;, m;) are equivalent; additionally one assumes that U; belong to the open covering 
{U;; 7 € J} of B; and {U;} is consistent with the fibre bundle structure, i.e., it satisfies 
conditions (1) and (2) of the definition of fibre bundle. 

The sets {U;,;} appearing in the definition of a fibre bundle (condition (1)) 
are called local trivializations of the bundle. If the structural group G has only one 
element, the bundle is called trivializable. 

1.5. Definition. A fibre bundle €(B) = (€,B,7,G,F) is said to be a differential 
fibre bundle (of the class C*) if €, B and ¥ are differential manifolds (of the class 
C*), the sets U; covering B coincide with domains of local charts on B as a manifold, 
G is a Lie group, and mappings m and +; are differentiable (of the class C*). In the 
following, without saying this directly, we shall consider fibre bundles of the class C®. 

1.6. Example. Let S? be a two-sphere with a chosen orientation. A pair of 
unit orthogonal vectors (€;,€2),, tangent to S? at z € S?, is called a diad. The set of 
all diads having orientation consistent with that of S* is denoted by P. Let us define 
the projection 7:P — S* by m(e1,e€2)2 = x. If (e1,€2)2 is a diad at a point x € S?, 
then the transformation 

e, = e1 cosy + e2siny, 
ey = —e,siny + e2,cosy 


gives another diad at the same point. Such transformations are elements of the 
group SO(2) which turns out to be the structural group of the bundle P(S?) = 
(P,S?,x,SO(2)). The typical fibre F is identical with the group SO(2) treated as 
a manifold (see below, sec. 2). On the strength of the Brouwer theorem about the 
impossibility to smoothly comb a vector field on a sphere of an even dimension, the 
fibre bundle P(S?) is not trivial. (See Trautman 1978, pp. 6-7.) 

1.7. Example. Mobius strip (see Choquet-Bruhat et al. 1982, pp. 126-127.) 

If a fibre bundle €(B) with a structural group G, after replacing G by its subgroup 
G1, leads to the equivalent bundle structure, the structural group G is said to be 
reducible to the subgroup G , or the bundle €(B) with the structural group G, is 
said to be a reduction of the bundle €(B) with the structural group G. If such 
a reduction occurs, then there exists a family of local trivializatons such that the 
transition functions 7,; corresponding to them assume values in G, (see chapt. 4, 


sec, 5). 


‘Bundles together with bundle morphisms define the category of bundles. By suitably generalizing 
the morphism concept (by taking into account the structure induced by the existence of a typical 


fibre and of a structural group) one can define the category of fibre bundles (see Sulanke and Wintgen 
1972). 
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2. Principal and Associated Fibre Bundles 


2.1. Definition. A fibre bundle €(B) = (€,B,7,G,F), the structural group of which, 
G, acts to the left on the typical fibre F, i.e., Ly(f) = 9f,9 € G, f € F, and such that 
F is identical with G (understood as a manifold), is called a principal fibre bundle. 
2.2. Example. Let p, be a basis in 7,(X"), i.e., pz = (e:), 2 = 1,...,n, where 
e; are linearly independent vectors; these vectors, in turn, can be expressed as linear 
combinations of vectors é; which form another basis in 7,(X). Of course, one has 


e;=a'é;, (a¥;) =a € GL(R”). 


Thus, there is a bijection between the set of all bases in 7,(X") and the group GL(R”). 
The group G£(R”) is a Lie group and, as a manifold, it can be identified with an open 
subset in R”. Let us define F(X”) to be the set of pairs (x, pz), for every x € X", 
and the projection t(z,p:) = zr. In this way, we have obtained the fibre bundle 
F(X") = (F(X”), X", 7, GL(R”)) which is called the fibre bundle of linear frames. 

2.3. Example. The fibre bundle of diads over a sphere is a principal fibre bundle 
(see example 1.6). 


Fig. 3.2. Right action of the structural group G on a fibre G, 
of a principal fibre bundle. 


Let €(X) = (€,X,7,G) bea principal fibre bundle. Let further p € G, = 77'(z), 
z € U;, with U; belonging to an open covering of X which defines the fibre bundle 
structure. Let us remember that ¢;2:G: — G, and let g, = ¢i,2(p). We define the 
right action IR,, g € G, of the structural group G on the fibre bundle in the following 
way (see fig. 3.2) 


(IR,p)i = $72 © Ry © bic = $;(Rogi) = $52(9:9)- 
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The definition is consistent with the fibre bundle structure, i.e., for p € x -*(U;N 
U;), one has (IR,p); = (IR,p); (the reader is urged to prove this as an exercise, see 
Choquet-Bruhat et al. 1982, p. 130). One can easily see that IR, acts (transitively) 
on a single fibre, i.e., (p € Gz) > (Rap € Gz), « = 7(p). 

2.4. Definition. A fibre bundle such that its typical fibre is a vector space 
and its structural group is a linear group is said to be a vector fibre bundle. A 
vector fibre bundle €(X) = (€, X,2,F,G) is said to be associated with the principal 
fibre bundle P(X) = (P, X,7,G) if there is a representation? p:G — IL(F,F) of 
the structural group G in the space F such that the transition functions +}; of the 
associated bundle €(X) are images (under p) of the corresponding transition functions 
of the principal bundle P(X), i.e., yj, = p(7%j), or more precisely: Let {U;, pi} be local 
trivializations of the principal bundle P(X), and {U;,y;} local trivializations of the 
associated bundle €(X). In such a case the transition functions +;; of the principal 


bundle P(X) are given by 


ese U;N U; — G, 
ri %;(7) =%e.oW%. €G, 


where %&~:G, — G, and the transition functions V5 of the associated bundle €(X) 
are given by 

ij(@) = ¢52° O72 = p(yu(z)) € L(F,F), 
where ¢;,¢: 77 1(z) > F. 

Sometimes one stresses directly that a given bundle €(X) is associated with a 
principal bundle P(X) via the representation p of the structural group. Having a 
vector bundle €(X)) it is always possible to construct effectively a fibre bundle P(X) 
with which €(X) is associated (see Choquet-Bruhat et al. 1982, p. 368). 

2.5. Example. Let 7(X") be the space of pairs (r, vz),  € X”, vz € T(X"), 
and let the projection be defined as m(x,v,) = z. Let us further suppose that 
{(U;,j)} is an atlas on the manifold X”. Let us consider two local charts (Uj, 1) 
and (U2, %2), and z € U; N U2, v € T(X"). We have 


Vie! Vero eR" 
bhai er Ve ER” 


The transformation V; — V2 is given by 
Yi2°,,: R” —> R” € GL(R"). 


In this way a fibre bundle T(X") = (7(X"), X",m,GL(R"), R") has been de- 
fined; it is called a tangent bundle to the manifold X”. Since there is a representation 


?We remind that a representation of a group G ona space F is a homeomorphism G — IL(F, F), 
where IL(¥, F) denotes the set of linear transformations f: F > F. 
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p:GL(R”) > IL(R",R”) the tangent bundle is associated with the bundle of frames 
F (BR (F(X) 2x, GE(R")). 

A C” mapping f:B — € such that 20 f = id is called a C’ cross-section of the 
bundle (€,B,7). AC’ field of contravariant vectors on a manifold X” can be defined 
as a C’ cross-section of the tangent bundle T(X”). 

2.6. Example. Let 7*(X") denote the space of pairs (z,w,), r € X®*, 
we € T,7(X"), and let the projection be defined as 72(z,w,) = xz. The fibre bun- 
dle T7(X") = (T7(X"), X",7,GL(R”), R") is called the cotangent bundle to the 
manifold X”. It is also associated with the frame bundle. A cross-section of the 
cotangent bundle is a field of covariant vectors or a 1-form on X”. 

2.7. Example. By generalising examples 2.5 and 2.6 we define a tensor fibre 
bundle of the type (q,p) over a manifold X” as T4,(X") = (T?,(X"), X",7,G,F), 


where 


T*,(X") is the space of pairs (z,t4,(x)),z € X", 
t%(z) € @77,(X") @ T7(X"), 
mona), 


G=@'*9GC(R"), F = @'R"@’ R™. 


It is a fibre associated with the frame bundle. A cross-section of a tensor bundle 
of the type (q,p) is a tensor field of the type (q,p) on the manifold X”. 

Tensor bundles over a paracompact manifold always admit cross-sections (see 
Choquet-Bruhat et al. 1982, pp. 385-386). 

The concept of reduction of a principal fibre bundle, introduced in the preceding 
section, can be reformulated in the following way. A principal fibre bundle €(.X) = 
(€,X,7,G) is said to be reducible to a principal fibre bundle €&(X) = (&1, X, 71,91), 
where €, C €, and G is a subgroup of G, if there is an injective mapping f:€ 7 € 
which is a morphism of the bundles €,(X) and €(X) and which commutes with 
the action of the group Gj, i.e., f(IR,p) = IR, f(p), 9 € G, for every p € & and 
rf(p) = m(p), for every p € &. 


3. Connections in a Principal Fibre Bundle 


Connection in a principal fibre bundle is a structure of crucial importance for our 
further considerations. Therefore, we will give here three (equivalent) definitions of 
this structure, beginning from the one which seems to be intuitively most clear and 
ending with the most operative one. The third definition gives a link between the 
remaining two.? 

3.la. Definition. Let P(X) = (P,X,7,G) be a principal fibre bundle. A 
mapping o,:7,(X) — 7,(P), for every p € P, m(p) = x € X, is said to be a 
connection in P(z), if 


3Here I closely follow Choquet—Bruhat et al. 1982, pp. 357-361. 
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(1) 9, is linear, 

(Q)\en co, —idion 7.(4)), 

(3) op depends differentiably on p, 
(4) on,p = FR’o,, for every g € G. 

The above conditions are chosen in such a way that the parallel transport made 
with the help of this connection is consistent with the structure of the fibre bundle 
as a differentiable and principal fibre. If R! > IJ — X is a curve t+ C(t), t € TZ, 
in X passing through a point to = 7(po), po € P, the parallel transport along the 
curve C’ is given by the curve C:R, >I P which is defined through the following 
relationships*, 


dC (t) dC (t) 


~ dt ar 
C(0)=po, C(t) =p. 
The curve C(t) in P is called the horizontal lift of the curve C(t) in X. A 


horizontal lift C(t) of a curve C(t) determines a correspondence between every two 
fibres along C(t), that is to say, it connects the bundle into an entirety. Condition (4) 
guarantees that two different horizontal lifts C, and C2 of the same curve C in aXe, 
to two different points p; and p2 of the same fibre x~!(x) over the point x € X, are 
connected with each other with the help of the right action JR, of the structural group 
on the fibre 7~!(x) in such a way that JR, transforms p; into py. 

Let us define a space of horizontal vectors (or a horizontal space) at a point 


peéPas 
Hy = {o,(T-(X)}, 2 = m(p). 


A space of vertical vectors (or a vertical space) can be defined, independently of any 
connection, as the set of vectors which are tangent to a fibre, i.e., 


Vv, ={vE€7,(P): r'v=0}, peEP. 


It is easily seen that the tangent space 7,(P) to the bundle at a point p can be 
decomposed into a direct sum of the horizontal and vertical spaces: 


T(P) = Hp ® Vp, 
1.e., for every v € 7,(P), v = vy + vy, with vy € Hy, vy € Vp. Evidently, one has 
mH, =T(X) and rV,=0, 2=n(p), 


The first of these relations defines the isomorphism of the horizontal space (gc 
and the tangent space 7,(X), and suggests that H, can be used to an alternative 


definition of the connection. 


‘In the following, magnitudes defined in a bundle will be denoted by the same symbols as the 
corresponding magnitudes in the base manifold but equipped with an additional tilde above the 
corresponding symbol. 
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3.1b. Definition. A connection in a principal fibre bundle P(X) = (P, X,7,G) 

is a field of vector spaces H, C T,(P) satisfying the following conditions: 
(1) the mapping 7’: H, — T,(X), z = 7(p), is an isomorphism, 

(2) the field H, depends differentiably on p, 

(3) Hr,p = RH. 

The transition to the third definition is the following. A correspondence between 
the spaces H, and V, suggests that the space V, can be also used to define connection. 
This suggestion is correct. It turns out that vertical spaces V, are isomorphic with 
the tangent space 7.(G) to the structural Lie group (as a manifold) at its neutral 
element e, i.e., with the Lie algebra G of the structural group G. It is this isomorphism 
that leads to the third definition. 

As is well-known, the space of all left invariant vector fields on a Lie group G 
(equipped with the Lie bracket operation) is the Lie algebra G of the Lie group G. 
The Lie algebra G (as a vector space) of a Lie group G can be identified with a 
tangent space to the Lie group G at its neutral element e through the mapping 
G>VrY Ve) € 7.(G). On the other hand, to any element J of the Lie algebra G 
of the structural group G there corresponds a vector field A(¥) on P in the following 
way. If J € G, then exp(Vt) is a one-parameter subgroup of G which acts on P to the 
right. This allows us to draw the curve py = MRexpiot)(p) through every point p € P. 
Now, for any real-valued function f on P we define 


d 
MMA) = FF) |_. 
As can be easily seen, the vector A(V), is tangent to the fibre m~'(a(p)) at p; it 
is, therefore, a vertical vector, \(v), € V,. The mapping \:G — Y, establishes a 
canonical isomorphism between the Lie algebra G of the structural group G and a 
vertical space V,, for every p € P. The vector field A(J) on P is called a fundamental 
vector field. 
Now, a Lie algebra valued 1-form on P, 


a) 4G, 


can be defined in the following way: if v € 7,(P) then w,(v) = 0 € G such that 

A(9) = vy € Vy. Or, in other words, one demands that w,(v) should be equal to the 

vertical component of the vector v which, in turn, by the canonical isomorphism is 

identified with an element of the Lie algebra. The ]-form w, called a connection form, 

determines a connection in the principal fibre bundle according to the following. 
3.1c. Definition. A connection in a principal fibre bundle P(X) = (P, X,7,G) 

is a 1-form w on P satisfying the following conditions: 

(1) For every v € V,, wp(v) = ¥ such that between v and VJ there is a canonical 
isomorphism, 

(2) w, depends differentiably on p, 

(3) wr,pUR,v) = Ad(g~")w,(v). 
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Let us notice that the set of all vectors v € J,(P) for which w,(p) = 0 determines 
the horizontal space H,. This fact allows one immediately to check the equivalence 
of the second and third definitions of connections. Condition (3) of the above def- 
inition is analogous to conditions (4) and (5) of the first and the second definition, 
correspondingly, but it is less transparent and requires a few words of comment. 

The restriction of the form w to a single fibre 9, =.1~'(x) gives 


w(vv)|o =UVy, we T (Gz) = V5 


If we perform the mapping s:G — G, such that s(h) = p if and only if p = Rapo, 
where A € G and po is a fixed point in G,, the form w|g, can be regarded as the 
Maurer—Cartan form of the Lie group G.° It can be proved that for a Maurer—Cartan 
form one has 


R5w(v) = Ad(g™)w(v), 


where Ad(g); g € G is the so-called adjoint representation of the group G; it is a 
representation in the Lie algebra G of this group (see Choquet-Bruhat et al. 1982, 
pp. 168-169). 

Keeping in mind that the action JR, preserves the decomposition of a vector 
v € T,(M) into its horizontal component vy and its vertical component vy (see 
condition (3) of the second definition of connection), it is easy to compute the pull 


back of w by JR, 
IR3(w)v = wUIRjv) = wR vy + IRyvv) = wR, vv). 


By treating w as a Maurer—Cartan form on G, one immediately obtains condi- 
tion (3) of the third definition of connection. 

To conclude this section, let us define parallel transport in an associated fibre 
bundle. 

3.2. Definition. Let €(X) = (€,X,7,G,F) be an associated fibre bundle with 
a principal fibre bundle P(X) = (P, X,7,G) by the representation p:G > IL(F,F). 
The parallel transport of an element v; € €,7,(v;) = 21, along a curve C from the 
point zr; € X toa point r2 € X, gives — from the definition — the element v2 € €, 
™(v2) = x2 such that 


$i(v2) = p(Vi(pe) 0 Vi(pi)~*) 4; (v1), 


where pz € P is obtained by the parallel transport of the element p; € P, (pi) = 21, 
in the principal fibre bundle P(X), along the curve C' from the point x, to the point 
rz. All other symbols are the same as in sections 1 and 2 with the exception that in 
symbols ¢;,, and W;,,, the index x was omitted for simplicity. 


*Let us remember that the Maurer-Cartan form of the Lie group G is a form w, defined on G, 
with values in the Lie algebra G of the Lie group G, such that w(vg) = 0, where 3 = Esty; € T.G. 
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The parallel transport in an associated fibre bundle, defined in the above way, 
depends on neither the choice of p, in a fibre ~1(z1) nor the choice of a trivialization 


of the fibre bundle. 


4. Geometry of Principal Fibre Bundles 


As we have seen, connection in a principal fibre bundle P(X) is a 1-form on P 
with values in a vector space, namely in the Lie algebra G of the structural group G. 
Let us recall that a p-form on a differential manifold X with values in a vector space 
Y is defined as 

P=" Bea, 
where y* is an ordinary p-form on X (with scalar values), and e, are basis vectors 
in Y. It is assumed that V is a real and finite-dimensional space. 

An ezternal derivative of a p-form ¢ is defined to be 


dp = dp” @ ea. 


It can be easily checked that this definition does not depend on the choice of a 
basis in V. Moreover, if V is a Lie algebra with the bracket operation [va, vg] between 
every pair of vectors v.,vg € V, then the bracket operation can be extended to forms 
assuming their values in V. Let y = y* @eq and w = W* ® eg be p-forms defined on 
X with values in Y. From the definition, we have 


[p, ¥] = (v" Ao") ® [eas eg], 


where A is the external product of forms. This definition is independent of the choice 
of a basis in V. 

Let us consider a principal fibre bundle P(X) = (P, X,7,G) with a connection 
in it defined as a 1-form w. Let 


AP) — HH, 


be a mapping defined by v + vy,v € 7,(P). Let also y = y* @e, be an r—-form with 
values in a vector space V having a basis eg. An ezternal covariant derivative of an 
r-form ¢ is defined to be 


Deo(v;,..., Ur, Vr41) = de(hv;,..., hv,, hors), 
where, as usual, dp = dyp* @ e€g. 


An r-form a on P(X) with values in a vector space V is called horizontal when 
a(v;,..-,Ur) = 0, if at least one of the vectors v;,...,v, is vertical. 
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4.1. Definition. An r—-form a on P(X) with values in a vector space V is said 
to be a form of the type (p,V), where p is the representation of the structural group 
G in the space V, p:G — IL(V,V), if 


IRya = p(g7 Ja. 


If this relation takes place the form a is said to be equivariant under the action of 
IR, with respect to the representation p. If additionally the r-form a is horizontal, 
it is called a tensorial form of the type (p,V). Let us note that the type (p,V) of 
a tensorial form informs us about the character of the tensor field in which, in the 
general case, the given form assumes its values; namely, the dimension of the space 
V is equal to the number of factors in the tensor product of tangent and cotangent 
spaces defining the given tensor, and the representation p(g~') conveys information 
about the properties of this tensor. 

4.2. Definition. An external covariant derivative of a connection one-form w 
on P(X) is said to be a curvature form of the connection w and is denoted by 2, i.e., 
Q = Dw. Of course, 2 is a 2-form on P(X) with values in the Lie algebra G. 

By using the relationship IRjw = Ad(g~') ow (see sec. 3), one shows that the 
curvature form is a tensorial form of the type (Ad, G). 

If 2 = DN = 0 the connection is said to be flat. 

4.3. Theorem. A connection form can be expressed with the help of the follow- 
ing equation, called the Cartan or structural equation (or Cartan structural equation)® 


D(u, v) = dw(u,v) + [w(u),w(v)]. = 


Before proving this theorem let us recall a useful formula expressing the external 
derivative of a form 0, 


dO(u,v) = L,O(v) — L,O(u) — O([u, v}), 


where L,(v) is a Lie derivative of the vector v in the direction of the vector u. This 
formula can be proved by working it out in components. 


Proof. 
2(u,v) = Dw(u,v) = duw(hu, hv) = dw(u — uy,v — vy) 
= dw(u,v) — dw(u, vy) — dw(uy, v) + dw(uy, vy) 
= dw(u,v) — dw(uy, vy) — dw(uy, vy) — dw(uy, vy). 
The second term of the middle row, dw(u,vy), has been replaced by the term 


—dw(uy + uy,vy), and analogously, the third term in this row. Two middle terms 


in the last row are equal to zero. Indeed, let us consider, for example, the second of 
them, 


dw(un,vH) = Ly w(vy) — Ly, w(vy) — wluy, vg]. 


°Sometimes the last term on the right hand side of the Cartan equation is multiplied by i; it 
depends on conventions assumed in the definition of external derivative. : 
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Let us notice that w(vy) = Jy € G, and Vy = const. Therefore, the Lie derivatives 
vanish. However, the last term is a horizontal vector since 


Leal 
[uv, ve] = Luy YH = lim 7 (Rove oa vH), 


that is to say, dw(uy,vy) = 0. 
It remains to compute the term dw(uy, vy). By using the same formula as before, 
one obtains 


du(uy, vy) = —wluy, ev] = —[w(uv),w(vy)] = —[w(u),w(v)]. 0 


By choosing a basis (e,) in the Lie algebra G, the Cartan equations can be 
expressed in coordinates 


1 
2% = dw* + BO Be Aw. 


Of course, the coefficients c%, are structure constants of the Lie group G. By differ- 
entiating, one obtains 


ar = weed Aw — aco A dw”. 
From the last equation one can see that 
Di" (uje,ie)— ase (hu, hy aw) — 0; 


for any u,v,w. This formula is called Bianchi identities; it can be written in the 
following compact form: DQ = 0. 


5. Geometry of the Frame Bundle 


In applications we are often interested in the geometry of a base manifold and in- 
dividual fibres over its points are treated as an “internal structure” of these points. 
The bundle manifold is, in such an approach, a kind of a “phase space” of internal 
states of base points. Such a “phase space of internal states” turns out to be a useful 
concept in investigating the geometric structure of the base manifold. This is the goal 
of the present section.. 

Let P(X) = (P, X,7,G) be a principal bundle, © a connection form in it, and f 
a smooth (C’) local cross-section of the fibre bundle, i.e., a cross -section of a region 


a™1(U) of the bundle, 


no f =1id. 
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5.1. Definition. A connection 1-form on U with values in G is defined as the 
pull back of the connection form in -'(U) C P by the mapping f, i.e., 


w, = (f*d)2(u) = wyay(f'u), 


where « € U, u € T,(X). 

It can be proved that if a 1-form w on U C X is given, with values in G, and f is 
a smooth cross-section of the bundle 7~!(U), then there exists the unique connection 
© in r71(U) such that w = f*&. The proof consists in, constructing a 1-form at 
p € f(U) with the required properties, and extending it to the entire region 7~*(U). 
Such a form at a point p € f(U) is given by 


@p(v) = we(m'v) + Vo, 


where v € 7,(P), p = f(z), is a vector such that v = v; + v2 and, from the definition, 
v, = (f'o7n’)v, ie., v; is a horizontal vector and, consequently, v2 is a vertical vector 
(x'vo = 0). The vector 32 € G is obtained from the vector v2 with the help of 
the canonical isomorphism between the Lie algebra of the structural group and the 
vertical spaces of the fibre bundle (see sec. 3). Extension of the form @, to all points 
of 77!(U) is made by using the action of the structural group G on x~1(U). (See 
Choquet—Bruhat et al. 1982, pp. 362-363). 

The definition of a local connection on the base manifold X (i.e., of a connection 
on U Cc X) essentially depends on a local cross-section f of the bundle (i.e., on a 
cross-section of t~'(U)). Among many such cross-sections one is privileged; let us 
denote it by s; and call it a cross-section canonically associated with the trivialization 
(U;, p;) of the fibre bundle. Let us define the mapping id;: U; x G by z ++ (z,e), where 
e is the neutral element of the group G; then, from the definition, s; = yj’ oid;. The 
following diagram will help to grasp the meaning of this definition: 


EE 


id, 
U; 


5.2. Definition. The connection form w; = s?@ will be said to be a connection 
form (or simply a connection) on a local trivialization (U;,y;). It plays an important 
role in physical applications. 

With the help of a little bit more tedious proof (see Chcquet-Bruhat et al. 1982, 
pp. 364-365) one can show that if (U;,y;) and (U;,y;) are two local trivializations 
of the fibre bundle P(X), then two connections w; and w; on these trivializations, 
corresponding to the same connection @ in P, i.e., w; = s7@ and w; = $)@ at a point 
xz € U;NU;, are related by the following formula, 


Wie = Ad(7;\(z))wje SF (¥;Omc)z, 
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where 7;; are transition functions (see sec. 1), and Oyc is a Maurer-Cartan 1-form 
of the structural group G (see sec. 3, footnote 5). 

A local curvature on a base manifold X is defined with the help of a similar 
procedure as a local connection on X. 
_ 5.3. Definition. Let (Uj, y;) be a local trivialization of a fibre bundle P(X) and 
9 a curvature form on 77 '(U) C P. A 2-form of curvature Q; on a local trivialization 
(U;, p;) is defined by 

2; = s*2. 


Two curvature forms (2; and (2, on two local trivializations (U;,y;) and (U;,y;) 
respectively corresponding to the same curvature form {2 on P at x € U; NU; are 
interrelated with the help of the following transformation, 


2;(u,v) = Ad(¥;(2)*)2;(u, v), 


As it is directly seen, the Cartan equations in a local trivialization (U;,y;) have 
the following form, 
QQ; = dw; + [w;, wi]. 


6. Applications to Gauge Theories 


The number of applications of the fibre bundle geometry to physics quickly increases. 
The most spectacular, and perhaps the most important, of these applications were 
mentioned in the introduction to the present chapter. Applications to the theories of 
space-time, gravity, and cosmology will be studied in the following chapters. As a 
corollary to this chapter, let us enumerate (after Trautman 1980) some intersecting 
areas of the theory presented in the above sections and a geometrical formulation of 
gauge theories. 

In the gauge theories one considers fibre bundles (both principal and associated) 
over space-time that plays the role of a base manifold. To construct a full gauge 
theory requires the selection of (1) a gauge group, (2) a particle type corresponding 
to the given gauge field, (3) a form of the field equations. The structure group 
of the fibre bundle is identified with the gauge group. In the Yang-Mills theories, 
connection forms on local trivializations (U,,y;) of the principal bundle are what is 
called by physicists gauge potentials, curvature forms $2; on (U;,y;) — gauge fields, 
and cross-sections of associated vector bundles — matter fields. 

To be more precise, let us consider a principal fibre bundle P(M) = (P, M,7,G) 
over space-time M and a vector bundle €(M) = (€,M,7,G,F) associated with 
P(M) by a representation p:G — IL(F,F). A Higgs field of the type (p, F) 1s defined 
to be a cross-section ~:M > N — €. When reading the literature one should 
pay attention to the fact that various authors accept different definitions; namely, 
some authors define a particle of the type (p,F) as a mapping w:P — F satisfying 
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the condition #(vg) = p(g7!)#(v) for every p € P and g € G, and the Higgs field 
W as the pull back of this mapping by the cross-section s; of the principal fibre 
bundle, i.e, W = stb: M — F (fig. 3.3) (see Trautman 1980, pp. 296-297). The 
relationship between these two definitions of Higgs fields in a local map (U;,¢;) is 
given by = biz oW (definition of ¢;,. in chapt. 3, sec. 1). Particle and Higgs fields 
are said to be matter fields. 


Identify 


Fig. 3.3. Higgs field (one should remember that p:G — IL(F, F)). 


Another important notion is strictly connected with matter fields, namely the 
notion of the spontaneous symmetry breaking. Let us consider a particle of the type 
(p, F) : . 

yp P—WCF, 


where the subspace W is an orbit of the Group G in Ff, 1.e., for any two points wo, 
w € W there is an element g € G such that w = p(g)wo. Let further H be an isotropy 
group (called also a stability group) of the element wo, i.e., 


H= {9 €G: p(g)wo = wo}. 


Let us define 7 
Q={peEP: W(p) = wo}. 


In such a case the fibre bundle Q(M) with H as its structural group is a reduction 
of the fibre bundle P(M) having the structural group G (see sec. 1). This reduction 
can be regarded as a geometric counterpart of the breaking of symmetries described 
by the group G to symmetries described by its subgroup H. 

It can be shown that the connection form we (corresponding to the structural 
group G), after the reduction of the fibre bundle P to the fibre bundle Q, defines the 
connection form wy (corresponding to the subgroup H) if and only if wy assumes its 
values in the Lie algebra H of the group H which in turn is equivalent to Dy) = 0 
(see Trautman 1980, pp. 297-298; 1981, pp. 409-413). 


Let g be a metric of space-time M (the Minkowski metric is used most often), 
and kj; = k(e;,e;) be a non-singular metric in the Lie algebra G of the group G (ki; 1s 
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invariant in the representation Ad), where (e;) is a basis in G. The form ki * QANQ), 
invariant” with respect to the action of the group G, can serve as a the Lagrangian of 
a gauge theory. In such a case sourceless field equations assume the form: D* 2 = 0. 


Bibliographical Note. Classical expositions on the theory of fibre bundles are Steenrod 
(1951) and Husemoller (1975); one could also use Sulanke and Wintgen (1977). The basic information 
can be found in every modern textbook of differential geometry, e.g. Kobayashi and Nomizu (1963) 
or Bishop and Crittenden (1964). 

The first gauge theory comes from Wey] (1918); in this form he presented the electromagnetic 
theory within the framework of its unification with gravitation theory. Only twenty years later, the 
forgotten Weyl’s idea was used by Schwinger (1951, 1953, 1962) in the theory of quantum fields. 
Great successes of gauge theories begin from the work by Yang and Mills (1954) in which the former 
ideas were applied to the theory of strong interactions. In the original version of the Yang-Mills 
theory, bosons transferring strong interactions turned out to be massless particles. This was the 
reason of the initial lack of interest in this theory. However, the crisis was efficiently overcome 
by Higgs (1964a, b, 1966) who proposed a mechanism of mass generation, called to-day a Higgs 
mechanism. It consists in a spontaneous symmetry breaking in interactions between the vacuum 
and the Higgs field. After generalizing, by Utiyama (1956), the Yang-Mills theory to other gauge 
symmetries, a dramatic expansion of these theories took place, crowned with the unification of 
electromagnetic and weak interactions by Weinberg (1967) and Salam (1968). 

It was Lubkin (1963) who first noticed a connection between gauge theories and geometry 
of fibre bundles. The gauge theories were afterwards systematically developed in the language 
of fibre bundles by Trautman (1978, 1979, 1980); see also Dreschsler and Mayer (1977), Daniel 
and Viallet (1980), Eguchi et al. (1980), Choquet-Bruhat et al. 1982, pp. 401~408, Nash and Sen 
(1983, chapter 7), Trautman (1984). A popular paper by Berstein and Phillips (1984) presenting 
applications of fibre bundle methods to physics is recommended. 


7Let us recall that the so-called star operator defines the isomorphism 
#: AP(X") —+ AM-P)(X”) 
by 
prep, 


where the form +, dual with respect to the p-form @, is a form such that r(a|@) = a A *@ for every 
form a € A(X"), where 7 is a volume element 7 = O'A---AO", and (-|-) denotes an inner product 
on A? (see Choquet-Bruhat et al. 1982, pp. 294-295). 
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Chapter 4 


The Frame Bundle and General Relativity 


0. Introduction 


One of the main reasons for going from special to general relativity was for Einstein 
the idea to extend the equivalence of all inertial reference frames to all frames of 
reference. As is well-known, such an equivalence, in general, can be achieved only 
locally. If a basis at a given point of space-time is to be considered as the mathe- 
matical counterpart of a local reference frame, then it is no wonder that the bundle 
of bases (frame bundle) over a space-time manifold provides a suitable mathematical 
environment to formulate the general theory of relativity. 

The frame bundle is a principal fibre bundle but, being a “concrete” bundle, it 
has a richer structure than “abstract” bundles, which were studied in the preceding 
chapter and which are used to geometrize gauge theories (of the Yang-Mills type). 
The richness of the frame bundle structure and its specific properties are strictly 
connected with the existence of the soldering form @ in this bundle. This form — as 
its name picturesquely suggests —- solders the frame bundle with its base manifold 
(space-time) which reflects the fact that the structure of the bundle is, in principle, 
determined by the structure of the base manifold. From the point of view of general 
relativity it is a natural property since, in this theory, gravitational field is “reduced” 
to the space-time geometry. 

A fibre 7~'(x) of the frame bundle over a point z of space-time forms a set of 
all bases at the point z, 1.e., the set of all possible frames of references at rz. One 
could say that in the case of the frame bundle over space-time, an “internal space” of 
an event z is equipped with a structure which enables one to describe all phenomena 
occurring in the neighbourhood of z, with respect to any frame of reference with its 
origin at x. In this sense, the idea of relativity is incorporated into the rich concept 
of the frame bundle over space-time. 

It turns out, however, that the structure of the internal space of events of the 
type “any possible frame at x” is too general (or too vague). As we know from 
chapter 2, physics can develop on a manifold only if this manifold carries a Lorentz 
metric. On the other hand, the appearance of a Lorentz metric on the base manifold 
reduces the frame bundle to the subbundle of orthonormal frames, or, in other words, 
it. corresponds to a spontaneous breaking of symmetries (described by the GL(R*) 
group) in the frame bundle into symmetries of the Lorentz group. The very possi- 
bility of physics enforces such a spontaneous symmetry breaking. This is reflected in 
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the structure of internal spaces in such a way that now this structure admits only 
pseudoorthogonal local frames. Experimental results tell us that physics distinguishes 
exactly this realization of the principle of relativity. 

In the present chapter the following topics are discussed: geometry of the frame 
bundle (sec. 2 and 3), the reduction of the frame bundle to the subbundle of or- 
thonormal frames (sec. 4). All these are only a preparation for sec. 5, in which the 
mathematical structure of general relativity is discussed; we will also focus on the 
problem of how the frame bundle structure leads to the correct choice of connection. 

In Comments and Remarks (sec. 6) some of possible generalizations of general 
relativity (the so-called connection—metric theories) are mentioned. This time, the 
Bibliographical Note at the end of the chapter constitutes its important part; it urges 
the reader not to be satisfied with a rather compact presentation of general relativity 
given here but to reach for at least some of available textbooks and monographs. 


1. Geometry of the Frame Bundle (continuation) 


The subject matter of the present and following sections is the bundle of frames 
F(X") = (F, X", 7, GL(R")) (see chapt. 3, example 2.2). Let us notice that a frame 
at a point z € X” can be also defined! as a non-singular linear mapping p,: R” > 
T,(X") defined by pr(u,...,Un) = u € 7,(X"); the numbers p,(uj,...,Un) are 
components of the vector u in the frame p;. It can be proved that every fibre bundle 
of the form P(X") = (P, X",7,GL(R”)) is isomorphic? with the frame of bundles if 
an only if one can define a soldering form in the bundle P(X"). Let us define it and 
explore some of its properties. 
1.1. Definition. Let the mapping 


$,: T,(X") — R", 


where p = (z,pr) € F(X") is a frame (basis) pz (with the basis vectors (e,)) at 
xz € X", be given by . 

$,(u)=6'(u), ue 7-(X"), 
where 6' is a dual basis with respect to (e,;). In other words, the mapping ®, coordi- 


nates to every u € 7,(X") its components in the frame p,. If one prefers to define a 
frame as pz: R” — T,(X"), then & = pz’. A one-form 


Ae aR” 


defined by 


1 Provided that a base in R” has been defined. 
2In the category of principal fibre bundles. 
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is called a soldering form (or a canonical form) on F(X"). One speaks also of the 
soldering of the manifold X” with the bundle of frames. (Since in the present section 
we are interested principally only in forms defined on frame bundle, we shall not 
distinguish them from those on the base manifold by the tilde.) 

The soldering form can be expressed in coordinates 


(9p(v))” = (a )"gdax"(x'v). 


Indeed, if 0, = 0/Ox%, (pr)a = a'x0g, 0 = 0%eq, where (€q) is the natural basis in 
R”, then 
O,(v) = ®,(n'v) = &,(da"(n'v) dp) 
= 6,(dx*(n'v)(a~*)"gay9s) 
= dx®(n'v)(a~") ey. 
The soldering form 6 on F(X”) is a tensorial form of the type (id, R%) (see 
chapt. 3, definition 4.1). Indeed, from its definition @ is a horizontal form. Moreover, 


if g € GL(R"), id(g) = g, v € T,(F(X")), then 


(IR59)p(v) = 6,(R,v) = ®,(m'UR,v)) 
= ® pop(7'v) = g',(zx'v) = g”*6,(v). 


The one-before-last equality follows from the fact that if p = (z, p,) H IR,p = 
(x, p,) then pz > p, = 9" pr. 

1.2. Definition. A connection in the bundle of frames is called a linear con- 
nection. The 2-form O = D6@ is said to be a torsion form of the linear connection in 
Fae). 

Now, we shall prove the following. 

1.3. Theorem. The torsion form can be expressed with the help of the following 
equation, called the Cartan equation or the structure equation (or the Cartan structure 
equation) for the torsion, 


O(u,») = d0(u, v) + [o(u), 0(0)}, 

where [w(u), A(v)] = w(u)0(v) — w(v)0(u), and w(u)@(v) denotes the action of w(u), 

which is an element of the Lie algebra GL(R”) of the group G(R"), on 0(v) € R®. = 
Proof. 


O(u,v) = DO(u,v) = dO(uy, vx) 
= d6(u, v) — d0(u, vy) — d0(uy, v) + dO(uy, vv), 
dO(uy,v) = Ly, 8(v) — O([uy, v]), 


since L,,@(uy) = 0 as @ is a horizontal form, but 


O({uy, v}) a tuy a9 = [Luystel9. 
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We have used the notation 7,@ = a(v), where a@ is a 1-form and v a contravariant 
vector (2, can be thought of as an internal product operation), and the following 
equality, 

eee tw]o = Uy eu] 
which can be easily checked (by direct computation in coordinates). By joining to- 
gether the two expressions above after straightforward manipulation, one obtains 


aay oy= LO) 1, 004 1, L,,8 = 1,L.,0. 
Let us note that 
dg(t) 
dt 


(this follows from the definition of connection as a form with its values in the Lie 
algebra of the structural group). By using the definition of Lie derivative and remem- 
bering that JR, acts ”vertically”, we arrive at 


= w(u) 


tO) 


tla 
ivy = ip (Ry), = iv (90)|_ = ~w(u)0(0), 


iC 


that is to say, 
dO(uy,v) = —w(u)O(v). 


A similar procedure leads to 
dO(u, vy) = w(u)A(v). 


The term d6(uy, vy) vanishes since 6 is a horizontal form. 

By collecting together all the above results we obtain the Cartan structure 
equation. o 

There exists a simple relationship between torsion and curvature forms. By 
taking external covariant derivative of the torsion form, we find 


De =) \0. 


The easiest. way to do this is to express the second term of the right-hand side of 
the Cartan equation as an external product. Let us recall that the external product 
of a p-form a and a q-form £ is defined to be 


(aA B)(vr,.-- 5 Upta) pl pq! ao sign 7) 


x ce coy Up) B(Up41, ON) Uta) 


where 7 denotes permutation of indices. One should note that sometimes, in place 
of 1/(p!q!), the factor 1/(p + q)! is used in the above formula; in such a case anti- 
symmetrization makes the factor 1/2 to appear. 
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2. Geometry of the Base Manifold 


First we will present a method to construct forms in a base manifold X” (with values 
in the space of tensor fields of a given type), canonically associated with tensorial 
forms (of the type (p, R®’)) in the bundle of frames F(X"). Afterwards, by using this 
method we shall construct forms on X”, canonically associated with the soldering form 
6, the torsion form O, and the connection form © in F(X). In this way, the geometry 
of the base manifold will be determined in terms of the frame bundle over X”. 

Now, we will generalize to the tensorial case the mapping ®,, defined at the 
beginning of the preceding section. Let us consider a manifold X” and let $,, p = 
(x, pr),  € X", map any tensor on X” into its coordinates with respect to the frame 
Pry 1.€., 

O,: @° Tz (X") 8 T.(X") — R", 
where g = s +t. Let us also consider v; € J,(F(X")) and u; € T,(X") such that 
tit (pion 

2.1. Definition. An r-form a is said to be canonically associated with an 


r-form & defined on F(X") if 
s(t. 0c) O>*(a(v1, sae Ur) 


where x = 7(p), and @ is a tensorial form of the type (p,R™) with values in R™. 
We shall show that this definition is independent of the choice of v; and of the 
choice of the frame pz. 
Indeed, (1) it does not depend on the choice of v;. Let v; and v} be any vectors 
such that uj = m’(p)v; = m'(p)vi; then 


B"(G(vi)) — BF" (&(v;)) = $5 *(G(v; — vj)) = 0, 


since @ is a tensorial form (and consequently horizontal) and the difference v; — v} is 
a vertical vector. 

(2) The definition does not depend on the choice of the frame p,. Let p = (z, pz) 
and p’ = (x,); in such a case there exists g € G(R”) such that p’ = IRgp. Let 
further vf = IR, vi- Since IR, acts vertically, we have n’v! = mv; = u;. Therefore, we 
obtain 


Gp'(vj) = (IRG&)p(vi) = p(g~")&p(vs) 


and $5! = > p(g) on the strength of the definition of $,. From the last two formulae 
it follows the independence of the definition of the choice of p. 0 

According to the above recipe the soldering 1-form @ on F(X”) defines a 1-form 
6 on X™. Since 6 is a tensorial form of the type (id, R”), 6 assumes its values in the 
space of contravariant vector fields. From the definitions of 8 and @ we have 


0,(u) = G516(v) = B5'O,2'v = w'v = u, 
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and consequently 6 defines the tensor field 6% = 65. 

Let us notice that the connection form in the principal bundle, being a non- 
tensorial form (it is not horizontal), does not define any form on the base manifold in 
the way discussed above. Asa local connection form on the base manifold X” we must 
employ the form w; = s*® where s; is the cross-section canonically associated with 
the trivialization (U;,~;) (see chapt. 3, sec. 5). Having in mind future applications, 
let us compute 


wi(r'v) = stH(x'v) = &(sir'v) = w(v), 
v € I F(X"). 
Remembering that the form w; assumes its values in the Lie algebra of the structural 
group of the bundle, we can also write w(m'v) = w%(m'v)E*%,, where (E4,) is a basis 
in GL(R”) and w%(7'v) is formally treated as a numerical matrix. The connection 
form §2 on F(X") is a tensorial 2-form of the type (Ad,R”). Therefore, it defines 


the 2-form 2 on X” ; 
2,(x'v4, 702) = @>*(2Q(v1, V2)). 


From the type of the form 2 one sees that Q assumes its values in the space 
of l-covariant and 1-contravariant tensor fields. By using the structure equation for 
curvature, we compute 


Q(m'v,, w'v2) = dw(n’v1, 1’v2) + [w(r'v1), w(2'v2)], 
or in coordinates 
2D = dw%s(x'v4, 1'v2) + w%(m'v1) ws (v2) [EX a, EY 
= dw%(n'v,, m'v2) + (w% A wg) (a’v1, 12) E% a. 
In the last step we have used the equality [€%,€°,] = 68&°, — 68€%,. And finally 
2% = dwg + w%, Aw. 


It is the very well-known form from the standard expositions of differential 
geometry, (Cartan) structural equation for the curvature form on the (base) manifold. 

The torsion form © on F(X”) is a tensorial form of the type (id, R”), therefore 
it defines the 2-form O on X” 


O,(x'v1, 7'v2) = 65 "(O(v1, v2)), 


which assumes its values in the space of contravariant tensor fields. From the structure 
equation for the torsion form we obtain 


O° (n'v,, v2) = dO%(x'v1, 7'v2) + w%(0'v1)0" ('v2) 


— w%g(1'v2)09(m'v1), 
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where (0°) should be treated as a basis in 7,"(X”). This equation can be put into a 
more condensed form 


@% = d6* +. w% + OF. 


If is the well-known structural equation for the torsion form on the (base) manifold. 


3. Geometry of the Base Manifold (continuation) 


In order to make the reader’s work easier, we now collect the more important formulae 
of the differential geometry on the base manifold (in coordinates), without proof. A 
detailed discussion (with proofs) can be found in standard text-books on modern dif- 
ferential geometry (e.g., Choquet-Bruhat et al. 1982, chapt. 5; O’Neill 1983, chapt. 3; 
Bishop and Goldberg 1980, chapt. 5; Kobayashi and Nomizu 1963, passim) or in the 
more extensive presentations of general relativity (e.g., Misner et al. 1973, part. III; 
Weinberg 1972, chapt. 6). All tensors and forms appearing in the present section are 
defined on the base manifold. 

The structure equations for torsion and curvature, correspondingly, can be writ- 
ten in the form 


O' =d'+wi Ag = sTiu6" AG, (3.1) 


; ‘ . lyee 
7, = du; +u7,, Aw = 5 ini AO, (3.2) 


where T,, and R?,; are torsion and curvature tensors respectively. Let (e;) be a 
moving frame and (6) a basis dual to (e;). One has [e,, e:] = ches; cy are called 
structure coefficients of the moving frame (e;). Connection coefficients 77,, are defined 
by the formula Ve; = 7,:6* ® e;, where V denotes covariant derivative. The torsion 
and curvature tensors in local coordinates assume the following forms, 


Ti = T (8, ex, €1:) = Yt — Vik — Cat, (3.3) 


Ria = R(e;, 6, ex, €1) = ex (4s) — ere) + em Vi — Vim Ves — Cini (3.4) 


In the natural basis (0/0z'), for which [0/0z',0/Oz/] = 0, formulae (3.3) 
and (3.4) are reduced to 


T=lu—P in, (3.5) 
Rig = OP — 0,0 gg + Pn Ps — Pin Pe, (3.6) 


where "4; are connection coefficients in the natural basis, called Christoffel symbols. 
The following formulae are valid in any local coordinates, 


>) Btn = OVS — TT a), (3.7) 


(jl) (jl) 
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yes ei i T,; Ria (3.8) 
(sk!) (gk!) 
The summation should be done after cyclic permutation (jli) of the three indices. 
Formulae (3.8) are called Bianchi identities. 


4. The Fibre Bundle of Orthonormal Frames 


4.1. Definition. Let us consider the following mapping, 
F: Py(Xi) = (Pi, X1, 71,91) —+ Po(X2) = (Po, X2, 2, Ga); 


F is called a bundle homomorphism if the bundles P\(X,) and P2(X2) are equivalent 
(see chapt. 3, definition 1.4) and if there is a homomorphism ¢:G,; — G2 such that 
FURg, Pi) = Roig.) F (pi), pi € Pi, 91 € G. A bundle homomorphism will be denoted 
by (F, 4). 

4.2. Theorem. If between P;(X1) and P2(X2) there is a bundle homomorphism, 
then a connection on P;(X,) uniquely defines the connection on P2(X2). m™ 

Proof. Let us assume that H,, C 7,,(P1) defines a connection in P\(X;) (see 
definition 3.1b, chapter 3). Then H,, = IR,,(F'(pi)), Hp, C Tp,(P2) defines a connec- 
tion in P2(X2). From the properties of IR, and of homomorphism ¢ it follows that the 
definition of H, does not depend on the choice of p, and pa. (See Choquet-Bruhat 
et al. 1982, p. 380). a 

Let P(X) = (P, X,7,G) be a reduction of the frame bundle F(X) (see chapt. 3, 
sec. 1). The inclusions P(X) C F(X) and G C GL(R") define a bundle homomor- 
phism. From this and from theorem 4.2 it follows that a connection in P(X) uniquely 
defines the (linear) connection in F(X). 

Let X” be a Riemann manifold. A basis (e,) on X” is said to be an orthonormal 
frame if (e;|e,) = 6,,. Having an orthonormal frame (e,) on X”, we can obtain from 
it any other orthonormal frame (e,) with the help of the transformation é, = a/ye,, 
where a, is an element of the group O(n) of orthogonal matrices n x n. There is, 
therefore, a natural bijection between the set of orthonormal frames on .X" and the 
group O(n). 

Let us denote the set of pairs (z,7,), where g € X™” and 7, is an ortho- 
normal frame at zx, by O(X"). The bundle of orthonormal frames O(X") = 
(O(X"), X", 71, O(n)) can be defined in a natural way; evidently, it 1s a reduction 
of the principal fibre bundle of (linear) frames P(X") = (P, X",7,GL(R")). On the 
strength of theorem 4.2 a connection in O(X™) uniquely determines a connection on 
P(X") which is called the metric connection. Since an orthonormal basis, when par- 
allelly transported with the help of the metric connection, remains an orthonormal 
frame, the covariant derivative of the metric tensor on X" vanishes. Direct compu- 
tation from (3.1) shows that the torsion of the metric connection is also equal to 


60 4. The Fibre Bundle of Orthonormal Frames 


zero. By using elementary tools of the Riemann geometry one can prove that, in any 
Riemann space, there is only one connection with these properties; it is also called 
the Riemann connection. 

Let (F, 4) be a reduction of the bundle F(X) with the structural group G to a 
bundle P(X”) with a structural group G, C G. The opposite question arises: does a 
connection in F(X") uniquely determine a connection in P(X”)? In general, since 
the mapping F need not be a surjection, the answer is negative. However, if X” is a 
Lorentz (pseudoriemannian) manifold and (F,¢) reduction of F(X”) to the bundle 
of orthonormal frames O,(X”), where k is an index of the manifold X” (see chapt. 2, 
sec. 1 and 3), then the above question chooses one and only one connection in O;(X") 
such that the torsion of this connection vanishes. To be more precise we formulate 
the following. 

4.3. Definition. Let F:P; — P, ¢:G, — G be inclusions defining a reduction 
of a principal fibre bundle P(X) with a structural group G to a fibre bundle P(X) 
with a structural group G,. One says that a connection w in P(X) can be reduced to 
P(X) if there exists a connection w; in Pi(X) such that w, = F*w. 

If a connection w in P(X) can be reduced to P;(X), then values of the connection 
form w, belong to a Lie subalgebra G, of the Lie algebra G of the structural group G, 
and the connection in P(X) uniquely determines a connection in P(X) (see Sulanke 
and Wintgen 1977, p. 171). As we have mentioned, in general this is not true. 
However, we have 

4.4. Theorem. There exists one and only one linear connection w in the frame 
bundle #(X”) which can be reduced to the bundle of pseudoorthonormal frames 
O;(X"), being the reduction of F(X") to the pseudoorthogonal group O(n, k), such 
that the torsion @ of w is equal to zero. = 

Proof consists in, first, showing that the linear connection in F(X”) can be 
reduced to the bundle O,(X”) if and only if Dg = 0, where g is a Lorentz metric 
(with index k on X”). Then, by using the relationship Dg = 0, one constructs 
the unique connection in O,(X"). If turns out that the coefficients corresponding to 
this connection can be transformed to the well-known form of the Christoffel symbols. 
The detailed proof would require several additional theorems with their corresponding 
proofs which, in turn, would mean reproducing a large part of a differential geometry 
textbook. The interested reader should be referred, for instance, to the book by 
Sulanke and Wintgen (1972, chapt. 2, sec. 11). 0 

Let us stress that theorem 4.4 distinguishes exactly one connection in O,(X”%); 
it will be called a Lorentz connection (the same name will also be used for the corre- 
sponding connection in X"). Let us notice that, on the strength of theorem 4.2, the 
Lorentz connection in O,(X") uniquely determines the linear connection in He: 

In the light of the above considerations it can be said that the existence of a 
Lorentz structure in X” breaks down the full G(R”) symmetry of the frame bundle 
to the symmetry O(n,k) which will be also called the Lorentz symmetry.? In this 


3It can be proved that the Lorentz structure (with index k) on a manifold X” is equivalent to 
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sense, one can think that the Lorentz structure (or Lorentz metric) behaves similarly 
as a Higgs field in gauge theories (see chapt. 3, sec. 6).4 


5. General Relativity 


The mathematical theory developed in chapters 3 and 4 essentially allows one to 
enrich the metric model of the space-time presented in chapt. 2 (especially sec. 2). 
We have seen how the important role of enriching the manifold structure is played 
by connection. From a logical point of view, one should determine directly which 
connection one has in mind when speaking of the pair (M,g), and consequently the 
pair (M,g), as a model of space-time, should be replaced by the triple (M,w,g), 
where w is a connection in the manifold M. Every choice of connection leads to a 
different theory of space-time (see below sec. 6, B). The ultimate criterion of the 
choice of a physical theory is the agreement of theoretical predictions deduced from 
this theory with empirical data. However, the history of science demonstrates that 
this criterion usually follows the criterion of mathematical elegance. Very often, a 
mathematical formalism of a theory distinguishes, in some way (sometimes uniquely), 
the structure which leads to a good agreement with experimental results.® 

Let us note that the analyses of the preceding section clearly suggest a correct 
choice of connection for the space-time model. If the new theory is to be formulated 
within the framework of the “frame bundle over space-time” scheme, and if it is 
expected to have special relativity as its limiting case, the pseudoorthogonal group 
O(4, 1), called also the Lorentz group £(4), ought to be its structural group. However, 
according to theorem 4.4, there exist one and only one linear connection in the frame 
bundle which is reduced to the bundle of orthonormal frames, namely the Lorentz 
connection for which Dg = 0 and © = 0. Since such a choice of connection is uniquely 
determined by the metric g, one can continue to denote the space-time model by the 
pair (M,g). (One should note that, according to theorem 4.4, a connection in the 
bundle of orthonormal frames O,(M) uniquely determines the linear connection in 
the frame bundle #(M), but the reduction of the connection in F(M) to O,(M) 
uniquely determines the space-time model.) 

It turns out that the above criterion leads to a physical theory which remains in 
very good agreement with experimental data. The choice of the Lorentz connection is 
characteristic of general relativity, and, as is well known, this theory has successfully 
passed all empirical tests. 


the reduction of the frame bundle F(X") to the bundle of orthonormal frames O;,(.") (see Sulanke 
and Wintgen, 1972, chapt. 2, sec. 11). 
4There are more analogies between Lorentz metrics and Higgs fields (see Trautman 1981). 
Collins (1977) calls it a “subliminal role of mathematics” in contradistinction to its “sublime 
role”. The latter consists in that “mathematics in physics acts as a language and a logical framework 
for discussion”. Collins notices a strong increase of the “subliminal role” of mathematics in the 


contemporary physics of gravity. 
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The only degree of freedom, which is left after the above analysis, is the choice of 
field equations for the new theory. Einstein, motivated by reasons of a philosophical 
character, believed that, on the one hand, geometry of space-time should be deter- 
mined by the mass-energy distribution and, on the other, motions of mass-energy 
should be determined by the space-time geometry. These intuitions — together with 
some physical arguments, such as the existence of special—relativistic and Newtonian 
limits — led Einstein to field equations which will now be, almost axiomatically, 
introduced. 

Coefficients of the Lorentz connection assume the form of Christoffel symbols, 
and components of the curvature tensor of this connection have the following sym- 
metries, 


es = —RoP vy; teaav = —Rgapvs (5 1) 
apie = Eecan, wan Rae = 0. 
The so-called Bianchi identities are also satisfied: 
) Ve ae. 
(yu) 
Contraction of the curvature tensor Ry%,g = Rey is called the Ricci tensor. Its 


subsequent (metric) contraction, g°°Rag = R, gives the so-called scalar of Riemann 
curvature. The Bianchi identities, after their contraction (in 6 and y) and multipli- 
cation by g®7, can be reduced to the form 


1 
V_( Ro, — a) = 0. (5.3) 


Terms in parenthesis are called the Einstein tensor and is denoted by G. By playing 
with indices, one can also write 


il 
Gap = leas ae Iu (5.4) 
It can be demonstrated that the tensor 
1 
Ge qe AGuy = Figy a 9 Iu + Agu, (5.5) 


where A = const, is the most general two-index tensor, constructed out of the metric 
tensor and its first and second derivatives and having the property (5.3) (vanishing 
divergence). Tensor (5.5) is a purely geometric quantity. According to Einstein’s 
idea, it should be connected with a term describing the “mass-energy distribution”. 
A natural candidate seems to be the energy-momentum tensor T,,,, appearing already 
in special relativity. Every particle in space-time carries an energy and momentum or, 
more geometrically, to every non-spacelike curve in space-time at each of its points 
there is a tangent four-vector P“, called the energy-momentum four-vector. For a 
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particle moving with a four-velocity V“ (with respect to a given reference frame), the 
energy-momentum four~-vector can be obtained from the energy-momentum tensor 
with the help of the formula 


PY =THYV". (5.6) 


The local conservation law can be written in the form of the vanishing divergence 
of the energy-momentum tensor, V,7% = 0, which suggests the following form for 
the field equations 

Gar Re ips (5.7) 
or 


Gin AGiy = hi, (5.8) 


where k = 87G/c*, G being the Newtonian constant of gravity, is called Einstein’s 
constant of gravity, and A, the cosmological constant. Both equations (5.7) and (5.8) 
were considered by Einstein as the field equations of his theory; the generaliza- 
tion (5.8) appeared in the cosmological context. In traditional non-cosmological 
applications, effects of the constant A are negligible and one can successfully use 
equations (5.7). Since Einstein the significance of the constant A for cosmology was 
widely discussed. For the sake of generality, and also because of the role played by the 
cosmological constant in contemporary theories of grand unification, we will usually 
consider the field equation in the form (5.8). 

In the “empty” space-time, where 7), = 0, equations (5.7) and (5.8) assume the 
form 


fee = 0, (5.9) 
ed ren (5.10) 


For dimensions n = 2 and n = 3, equation (5.9) implies vanishing of the curvature ten- 
sor; for dimension n = 4, the Ricci tensor has 10 independent components whereas the 
curvature tensor has 20 independent components, and consequently equations (5.9) 
(and of course equations (5.10) as well) have non-trivial solutions. 

The field equations can be also obtained from the last action principle by assum- 
ing a suitable Lagrangian for gravity and source-fields (see, for example, Landau and 
Lifszic 1980). However, one should remember that, irrespectively of any theoretical 
reasons, the only lawful argument on behalf of any field equations is their agreement 
with actual empirical results. So far equations (5.7) (or (5.8)) successfully satisfy this 
criterion. 


6. Comments and Remarks 
A. A geometric space-time structure, indispensable for building up the general theory 


of relativity, has been introduced in terms of fibre bundle of linear frames over space 
time reducible to the bundle of pseudoorthogonal frames with the (homogeneous) 
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Lorentz group as its structural group. This is not the only possible choice. Equally 
natural structure would be the fibre bundle of affine frames A(M) over space-time 
M with the affine group GA(R*) as its structural group, reducible to the Poincaré 
group (see chapt. 2, sec. 3). 

The fibre bundle of affine frames A(M) can be thought of as a Cartesian product 
of the fibre bundle of linear frames L(M) and the tangent bundle T(M), A(M) = 
L(M) x T(M), with the affine group GA(R*) = GL(R*) x R* as its structural group, 
where the group GA(R*) acts in the following way: if e: R* > 7,(M) is a linear frame 
at r € M, and u € T,(M), g € GL(R*), be R*, then 


A(M) 2 (€, u)(9, 5) = (eg, u + e(b)). 


The construction of general relativity in terms of the fibre bundle of affine frames 
does not differ essentially from that in terms of the bundle of linear frames (see 
Trautman 1980, 1981). One could also choose other principal fibre bundles over space- 
time which would equivalently lead to the Einstein theory of gravity (see Trautman 
1976). 

As the analyses of the present chapter have shown, general relativity can be 
treated as a gauge theory with a Lorentz metric on space-time playing a role similar 
to that of Higgs fields, namely, spontaneously breaking symmetries to a subgroup of 
the structural group (see chapter 3, sec. 6). Essential differences between Einstein’s 
theory of gravity and other gauge theories come from the existence of the soldering 
form in the frame bundle. This causes the fact that the geometry of the frame bundle 
is entirely determined by that of the base manifold, namely of space-time. (For more 
detail on general relativity as a gauge theory see Trautman 1978, 1980, 1981, 1984). 

In sec. 5 of the present chapter, the field equations of general relativity were 
introduced directly as certain postulated relationships of tensorial type on space— 
time. By consequently using the theory of frame bundles over space-time, the fields 
equations can be introduced in the following way. 

Let us consider the bundle of orthonormal frames O,(M) over space-time M, 
with the homogeneous Lorentz group £ as a structural group. Let w and 2 be 
connection and curvature forms on ©,(M), correspondingly. Let further g be a 
metric on M, and h a biinvariant metric on the group C (i.e., both right- and left- 
invariant metrics, on £). A metric on O,(M) can be constructed in the following 
way, 


V(X, X) = g(da(X), dr(X)) + h(w(X), w(X)), 

where X is a tangent vector to O;(M). This metric is invariant with respect to 
the action of the group £. Let us assume +24; A 24; to be a Lagrangian for gravity, 
compute the Ricci scalar R of the metric y, and perform the variation of the action 
[{ x volume element on O,(M)] with respect to g and w. We shall obtain the 
vacuum field equations with the cosmological constant together with suitable gauge 
equations, D +*§2 = 0 (see chapt. 3, sec. 6, and more extensively Trautman 1984, in 
particular pp. 105-107). 
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B. Analyses of the present chapter inevitably lead to the question: can we obtain 
a theory of gravity different from general relativity by choosing different connections 
in a principal bundle or by performing a reduction of the frame bundle to a different 
subbundle than that of pseudoorthogonal frames? The answer is positive. The prob- 
lem has been extensively studied by Yasskin (1979). In such theories, called metric 
connection theories of gravity, the metric g, the connection w, and the soldering form 
6 are dynamic variables. Dynamical equations of a given theory are obtained from 
the stationarity of the action S(g,w,0,w), where w is a cross-section of an associated 
bundle which represents matter fields (see, chapt. 3, sec. 6). 

The so-called theories with a Cartan connection, i.e., with a connection con- 
sistent with the metric but having a non-vanishing torsion, constitute an important 
subclass of metric-connection theories of gravity. The Einstein—Cartan theory, be- 
longing to this subclass, was especially thoroughfully investigated. The torsion in this 
theory is interpreted as macroscopic spin of particles. Studies concerning this theory 
go back to Weyl (1919) and Cartan (1922, 1923, 1924, 1925); afterwards the theory 
was developed by Kibble (1961); Sciama (1962); Hehl (1973, 1974); Trautman (1972 
a~c, 1973 a-c, 1975, 1978, 1980, 1981). 


Bibhographical Note. More about the structure of the frame bundle can be found in the 
literature dealing with bundles in general (see bibliographical note to chapter 3), in the works by 
Trautman (1970b, 1976), and especially in the review paper by Dodson (1979). 

The presentation of general relativity in sec. 5 of the present chapter was very concise. The 
reader should consult at least some expositions of the huge literature concerning this beautiful theory. 
I shall quote only some examples. 

The following textbooks are often recommended: Weinberg (1972), Misner et al. (1977), Lan- 
dau and Liphshitz (1980), Rindler (1977), Schutz (1983). To classical textbooks, which always 
deserve attention, belong Weyl (1922), Eddington (1923), Tolman (1934), Robertson and Noonan 

1968). 
Mathematical structure of the theory was elaborated in the following monographs: Petrov 
(1966), Hawking and Ellis (1973), Sachs and Wu (1977), Carmeli (1977), Beem and Ehrlich (1981). 
O'Neill (1983) can serve as a good introduction to the mathematical structure of general relativity. 
No serious work concerning solutions to Einstein’s field equations can be done without consulting 
the fundamental monograph by Kramer et al. (1980). A useful collection of exercises is in Lightman 
et al. (1975). 

Philosophical aspects of general relativity are explored by Tonnelat (1971), Raine and Heller 
(1981), Friedman (1983), and Toretti (1983). 
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Chapter 5 


The Cosmological Problem 


0. Introduction 


Relativistic cosmology, or more precisely its theoretical foundation, is a main topic 
of the present book. The foundations of cosmology are deeply rooted in the con- 
temporary theory of space-time and gravitation. Therefore, the preceding chapters 
constitute a preparation for the main topic, or perhaps, even more than a prepara- 
tion: as I shall try to demonstrate in the present chapter, global aspects of space-time 
and gravitation belong, in a sense, to the field of cosmology. Now, the cosmological 
problem will be addressed explicitly. 

Cosmology aims at investigating the structure of space-time on its largest scale. 
One often speaks of the Universe, having in mind not only the space-time itself, but 
also everything that fills it in, i.e., all physical fields defined on space-time. All these 
concepts are understood more or less in an intuitive way. However, one must admit 
that it is not a rough intuition, but an intuition shaped by long tradition, especially 
by the evolution of concepts, which has acquired a great acceleration in our century. 

There exist two essentially different “directions” of studying cosmology. The 
“down-direction” consists in assuming some a priori postulates concerning the world 
structure at large and trying to deduce the “local physics” from them. The “up- 
direction” accepts the local physics as given and tries to extrapolate from it as far 
as possible to guess at the global structure of space-time. The first way of doing 
cosmology was represented by Milne (1935, 1948), and Bondi and Gold (1948); but 
it was the second approach that has acquired the paradigmatic citizenship in the 
cosmology of the XX" century. The extrapolating cosmology also cannot avoid doing 
some more or less aprioristic assumptions concerning either the Universe itself or 
the nature of cosmology as a science of the Universe. Extrapolation, from its very 
essence, goes beyond local collecting of information, and if it does not explicitly make 
some assumptions supervising this process, this means it has made such assumptions 
implicitly. Unfortunately, the majority of publications in cosmology (not to say all 
with very few exceptions) choose the second possibility. This is an additional motive 
to write a chapter in a book on foundations of cosmology discussing such problems, 
even at the risk of loosing something of the formal consistency of the book. At the 
present state of its development the analysis of the cosmological problem cannot be 
fully mathematical. However, an analysis seems indispensable for a mathematical 
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formalism to be able to correctly fill in with a cosmological content. This justifies a 
different, rather unformal, character of the present chapter. 

In sec. 1, I try to determine what it means that cosmology is a global geom- 
etry of space-time or a non-local physics. Quite unexpectedly, it turns out that 
non -local assumptions (with a natural understanding of “non—local”) are involved in 
every empirical prediction of standard physics. Global character of cosmology can 
be understood in a strong sense as a study of the set of all solutions to cosmological 
equations (Einstein’s field equations), i.e., of the set of all cosmological models (such 
a set is called an ensemble of universes). Understanding of cosmology as a theory of 
such a set is not only a useful methodological trick but it has been implicitly assumed 
in the scientific practice for a long time. These problems are considered in sec. 2. One 
cannot forget that cosmology has the ambition to belong to the family of empirical 
sciences. Consequently, the cosmological problem cannot be responsibly dealt with 
without touching its observational aspects. These aspects certainly deserve a separate 
monograph. One can hope that the works of Ellis’ group (quoted in the present chap- 
ter) will lead to such a monographic study. Therefore, in sec. 3, I only summarize the 
most important results of the Ellis programme. These results strengthen the above 
formulated thesis that the discussion of empirically unverifiable assumptions, which 
enable cosmology to be a science of non-local structure of space-time, must enter the 
core of the cosmological problem. 


1. Cosmology as a Global Geometry of Space-Time 


As we know from the preceding chapters, al] dynamical theories of hitherto physics 
presuppose the manifold structure of space-time. However, the traditional approach 
to physics makes use almost exclusively of local properties of the manifold model. 
What matters is only a “correct behaviour” in the neighbourhood of the point at 
which experiments or observations have been made. In practice, one is interested in 
how “to translate statements about geometrical entities in space-time into statement 
about real numbers” (Friedman 1983, p. 33), and the work is smoothly done by local 
maps which define the manifold structure of space time. On the other hand, the 
very concept of a manifold is a non-local notion, and the assumption that physics 
can develop by using only local methods, i.e., that a “local neighbourhood” could be 
isolated from global influences, turns out to be a strong assumption of a non local 
character. 

From the field-theoretical point of view the situation is the following. Local 
empirical predictions are trustworthy only if unexpected perturbations are excluded 
in the future comprised by the prediction. In order to predict the state of a field at a 
certain instant one must know “initial conditions” at an “earlier” spacelike hypersur- 
face (or at a null cone, see Penrose 1980). The assumption that there are no signals, 
generated by a distant event, which at the next moment could affect the investigated 
system (“no-news” assumption, see Ellis 1983), implies a knowledge about the field 
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state on a distant (towards the past) hypersurface. In this sense, cosmological as- 
sumptions are contained in every empirical prediction (see Heckmann and Schucking 
1959). 

The above considerations justify the following assertion: The Universe possesses 
a certain structure (or it is a structure); this structure can be investigated by using 
either local or non-local (global) methods; the first is a task of physics, the second 
— of cosmology. For the time being, terms “non-local” and “global” are used in- 
terchangeably. In the following, I shall explain the meaning of the above thesis and 
argue on behalf of it. Let us now pause for a while to reflect upon a difference between 
local and global methods. Even in contemporary mathematics, a substantial part of 
which consist of “global analysis”, these terms are fuzzy and non-precise. In Smale’s 
opinion, “global analysis is simply the study of differential equations, both ordinary 
and partial, on manifolds and vector space bundles” (Smale 1980, p. 84). As we can 
see, more or less, such a definition of “globality” has been accepted in the preceding 
chapters. The manifold model of space-time and fibre bundles over space-time were 
for us fundamental research tools from the very beginning.’ 

Modern observational astronomy deals with such distant objects (galaxies, ra- 
diogalaxies, quasars, ...) that no reasonable interpretation of what is observed can 
be made without a theory of the space-time over which received signals are propa- 
gated. This means that some features of the Universe, investigated by contemporary 
astronomy, essentially depend on integration of some differential equations over a 
“large scale” domain of the space-time manifold. In this sense cosmological methods 
permeate the domain of the extragalactical astronomy. Very often this is obscured by 
the fact that astronomers usually employ the simplest cosmological models ascribing 
to the Universe strong symmetries (such as Robertson—Walker symmetries). Such 
symmetries effectively eliminate the necessity to compute troublesome integrals over 
space-time. Of course, this fact substantially simplifies the situation but, on the other 
hand, it “screens out” the role of cosmology in standard astronomical investigations 
(see Ellis and Sciama 1972). 

General relativity suffers a certain dualism. Space-time manifold, equipped with 
different geometric structures, is a “primitive element” of the theory, but on this 
space-time manifold various physical fields are defined. It is true that these physical 
fields are also represented by certain geometric structures on space-time (such as 
scalar or tensor fields); however, they do not appear naturally in the process of the 
development of the theory, but are introduced into it by decree. Einstein’s field 


‘A tradition of global analyses in mathematics goes back to Poincaré. To denote this kind of 
research, Morse in 1934 used the term “macroanalysis”. He noticed that “any problem which is 
nonlinear in character, which involves more than one coordinate system or more than one variable, 
or whose structure is initially defined in the large, is likely to require considerations of topology and 
group theory in order to arrive at its meaning and solution” (quoted after Smale 1980, p. 118). The 
famous Morse theory can serve as an example of non-local methods in mathematics. This theory, 
to a large extent, contributed to developments of the global analysis. (See, Smale’s essay What Is 
Global Analysis? in Smale 1980, pp. 84-89; see also ibid. pp. 117-127). 
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equations (see chapt. 4, sec. 5) relate geometry and physical fields with each other. 
In this way, a non-local character of geometrical methods in cosmology implies a 
non-local character of physical fields (with the exception of gravitational field which 
is already incorporated into geometry). One could simply say that cosmology is a 
non-local physics. 

It is interesting to notice that in such exotic (from our present point of view) 
states at the first fractions of a second after the initial singularity, the very dis- 
tinction between local and non-local ceases to be clear cut. The closer we are to 
the initial singularity, the more strongly will any physical property imply non-local 
consequences.” 

The initial singularity itself turns out to be a non-local property of the model. 
From the geometric point of view singularities are understood as geodesic incom- 
pleteness of space-time. A space-time is geodesically incomplete if there is at least 
one non-spacelike geodesic in it along which the affine parameter cannot assume arbi- 
trarily large values (see Hawking and Ellis 1973). The well-known Penrose-Hawking 
Geroch singularity theorems (see ibid.) show that if a space-time possesses certain 
properties, they cannot be “fitted together” with the requirement of the geodesic 
completeness. Among those properties there are always some of a manifestly non- 
local character, for instance: spatial compactness, non-existence of closed timelike 
curves, existence of a Cauchy surface, etc. It is worthwhile to note that in the initial 
singularity of the standard cosmological model (in the so-called Big Bang) all non- 
spacelike geodesics break down; this fact additionally strengthens a global character 
of the singularity. 

Even in space-times in which not all non-spacelike geodesics break down at a 
singularity, the appearance of a singularity causes the appearance of horizons and 
boundaries which are typically global properties of space-time (see Penrose 1968, 
Hawking and Ellis 1973, Geroch and Horowitz 1979, Tipler et al. 1980). 

There are different. kinds of singularity (see Clarke 1975, Clarke and Schmidt 
1977, Ellis and Schmidt 1977, Tipler et al. 1980), and some of them are “more global 
than others”. A singularity is said to be a curvature singularity if in a given space- 
time there exists at least one non-spacelike curve such that the components of the 
curvature tensor, in a frame parallelly propagated along this curve, diverge as one 
approaches the singularity. A singularity is said to be quase-regular if in such a 
situation components of the curvature tensor, in any such frame, behave correctly. 
Suppose that an observer approaches a quasi regular singularity. He is in no way 
locally warned of the imminent danger. Indeed, on the strength of the Clarke theorem 
(1973), his history (a timelike curve) has a neighbourhood isometric to a one in a 
complete space-time manifold. “This means that the ‘had behaviour’ that causes the 
incompleteness is of a global nature: near a single curve, one does not realize that 
anything is wrong” (Tipler et al. 1980, p. 159). Ellis and Schmidt (1977) have shown 


2This idea is contained in the primeval atom hypothesis of Georges Lemaitre (see Godart and 
Heller 1985). 
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that space-time in a neighbourhood of a quasi-regular singularity can be thought of as 
being glued together out of several copies which are perfectly regular. As we can see, 
in such a situation everything seems to be in order, and the geodesic incompleteness 
has clearly a global character. 


2. Cosmology as a Theory of the Ensemble of Universes 


In mathematics, there is also another understanding of what global could mean. We 
face it when, for example, we are dealing, not with a single differential equation on 
a manifold M, but with a space of all possible equations (of a given type) on a 
manifold M. Steve Smale, in his short essay entitled What is global analysis? , after 
having quoted Peixoto’s theorem asserting that the structurally stable differential 
equations on a compact two-dimensional manifold M form an open and dense set, 
writes: “This theorem is an excellent theorem in global analysis. One sees in two ways 
how it is global. First, the differential equation is defined over a whole manifold, and 
structural stability depends on its behaviour everywhere. Second, the theorem makes 
a conclusion about the space of all differential equations on M” (Smale 1980, p. 88). 
In the preceding section we have explored the first meaning of the globality as applied 
to cosmology, now we shall comment upon its second understanding (which could be 
termed “superglobality”). In general relativity things are much more difficult than 
in any “classical” theory of differential equations. Einstein’s equations themselves 
determine the manifold on which they act, and Smale’s second understanding of what 
global means leads to much more difficult — and, one must admit, philosophically 
much more fascinating — problems. There is no other way; such an understanding 
must be reduced to the study of all admissible solutions to Einstein’s field equations. 
A mere look upon the history of the twentieth century cosmology reveals that it 
has developed, as far as its theoretical aspects are concerned, by constructing newer 
and newer cosmological models, that is to say, by exploring the space of all possible 
solutions of Einstein’s equations. 

In agreement with a commonly accepted custom among relativists, the space 
of all solutions of Einstein’s field equations will be called the ensemble of universes 
or shortly the ensemble. We shall try to look at cosmology as at a theory of the 
ensemble. We will argue that such an approach to cosmology is unavoidable. 

Here we meet a crucial point. It is commonplace, both in scientific and in popular 
literature, to stress that all peculiarities of cosmology, as a science, follow from the 
fact that “the Universe is given to us in one copy”. I know of no research work 
that would go more deeply into this methodological peculiarity of the science of the 
Universe. I will dwell on the problem for a while. 

For the empirical method it seems essential to have many instances of a similar 
type. Laws of physics are formulated in the form of differential equations which 
express a structure built up from relations among quantitative properties common to 
many phenomena. Individual characteristic of every phenomenon are accounted for 
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by choosing suitable initial or boundary conditions which select a particular solution 
from the class of all admissible solutions. The uniqueness of the Universe makes 
it impossible to transfer this strategy directly to cosmology. However, one could 
hardly see how to develop a science otherwise. Cosmology, therefore, has chosen the 
following tactics. One assumes that a structure called the Universe is described by 
a solution of a certain equation (usually, Einstein’s field equations); one attempts to 
find all its possible solutions, and then to identify those initial or boundary conditions 
which correspond to the Universe we actually observe. All other solutions describe 
other possible universes. Owing to this strategy the actual Universe is placed in 
the environment of many universes. Now, one has many “individuals”, and one 
can proceed in the same manner as in other departments of physics. In this sense, as 
Bondi (1960, p. 10) remarks, “we select the important (as opposed to the ‘accidental’) 
features of the actual universe by their relation to the theory chosen rather than by any 
independent criterion”. In the following, the set of all possible solutions to Einstein’s 
field equations will be called the ensemble of universes, or the ensemble, for brevity. 

The richness of all possible solutions of Einstein’s field equations is so great that 
usually we explore only “small” subsets of the set of all possible solutions (for instance, 
spatially homogeneous or homogeneous and isotropic solutions). However, the fact 
that we can explore subsets of the ensemble is essential for cosmology. Recently, one 
can notice quite a pronounced tendency: it seems that cosmologists are more inter- 
ested in, say, how a given property (e.g., the existence of singularities, the appearance 
of horizons, etc.) is distributed within the ensemble rather than in the structure of 
individual world models (of single points of the ensemble). This tendency could be 
explained by methodological reasons. Because of the measurement error inherent in 
every cosmological observation, a solution which pretends to describe the real Uni- 
verse can do it only up to a finite degree of accuracy. If we improve our measurements, 
we must be ready to change to some “nearby” solution which could describe the Uni- 
verse to a better approximation. In this sense, a structural stability of solutions must 
be postulated (see Thom 1977, Arnold 1980). The problem is by no means a trivial 
one. It must be decided which solutions are to be defined as identical (or equivalent), 
and when the distance between two different (non-equivalent) solutions is to be said 
“small”. The decision must be both mathematically and operationally realistic. From 
the mathematical point of view, the space of all solutions must be equipped with a 
metric, or at least. with a suitable topology. From the operational point of view, the 
adopted definitions should correspond to what, and with which accuracy, is measured 
in observational cosmology. 

The stability property is also interesting with respect to other features, not nec- 
essarily directly observable, which we would like to ascribe to the actual Universe (for 
instance with respect to such properties as the existence of singularities or horizons, 
different degrees of causality, and so on). If a property of a certain model is shared 
by “neighbouring” models it is said to be a typical property (or a generic property).° 


3More precisely: a subset of a topological space is said to be a Bair set if it is an intersection 


2 2. Cosmology as a Theory of the Ensemble of Universes 


This presupposes a metric or at least a topology on the ensemble which have to be 
both mathematically workable and leading to physically reasonable results. So far a 
“natural” topology on the ensemble is not known, and to investigate the stability of 
different properties one must use different topologies (see below). Theoretical aspects 
of the structural stability of certain classes of cosmological models were investigated, 
among others, by Collins (1974, 1977), Golda et al. (1983), Szydiowski et al. (1984); 
observational aspects of structural stability still await elaboration (see remarks at the 
end of sec. 3). 


The notion of the ensemble of universes was implicitly used in cosmology for a 
long time. For instance, the Cauchy problem, when considered within the cosmo- 
logical context, leads to this notion almost immediately: the set of all admissible 
Cauchy data (on a certain spacelike hypersurface) may, quite naturally, be identified 
with the set of all universes which can be developed out of these data. Explicitly, the 
concept of the ensemble began to emerge in the work by Dicke (1961) who, for rather 
philosophical reasons, postulated that there exists not only the Universe but a whole 
class of universes with all possible values of physical constants. These ideas found 
a more elaborate form in Carter’s talk during the Symposium of the International 
Astronomical Union in Cracow, 1973. Carter used the concept of the ensemble of 
universes to formulate a doctrine called by him the anthropic principle. He used also 
the name “world-ensemble”. “By this | mean — he wrote — an ensemble of universes 
characterized by all conceivable combinations of initial conditions and fundamental 
constants (the distinction between these concepts, which is not clear-cut, being that 
the former refers essentially to local and the latter to global features)” (Carter 1974). 
According to the author, “the existence of any organism describable as an observer 
will only be possible for certain restricted combinations of the parameters, which dis- 
tinguish within the world-ensemble an exceptional cognizable subset”. Since we are 
certainly “describable as organisms”, the Universe we are living in must belong to 
this distinguished subset.* 


of countably many open and dense subsets of this space. A property is called typical if it is shared 
by all elements of a Bair set. Property of the complement of a Bair set is called non-typical. The 
difference between typical and non-typical properties cannot be reduced to whether a corresponding 
set is dense or not (complements of Bair sets can be dense) or to the measure criterion (in the 
case of the space of a finite dimension, a Bair set can have zero Lebesgue measure). However if, 
between two contradictory properties that can be possessed by a model, one is typical, the other 
must be non-typical, and vice versa. An important notion is that of a strongly typical property. 
A property is strongly typzcal if it is possessed by elements of an open and dense subset of a given 
topological space (some authors, when writing of typical properties, have in fact in mind strongly 
typical ones). Correspondingly, one speaks about strongly non-typical properties. If a model has 
a strongly typical property, it preserves it after being slightly perturbed. Moreover, a property is 
preserved if a perturbed model is perturbed again provided the second perturbation is “smaller” than 
the first one. On the contrary, if a model has a strongly non-typical property, a slight perturbation 
can always be found which liquidates this property. (See Richtmyer 1981, pp. 304-311). 

‘Later on, the anthropic principle gained considerable popularity among cosmologists (see, for in- 
stance, Carr and Rees 1979). The possibility of reintroducing the man back into the post—Copernican 
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In the last decade — independently of any philosophical inspirations — the 
space of solutions of Kinstein’s field equations was a subject of intensive mathematical 
research. Usually, one first considers the space Lor(M) of all Lorentz metrics on a 
manifold M. Serious difficulties are encountered already at this stage. It turns out 
that one can introduce infinitely many topologies on Lor(M), and neither of them is 
natural. To show the nature of the problem let us consider it in a more detailed way. 

The neighbourhood of a metric g € Lor(M) can be understood as a set of 
Lorentz metrics on M, whose derivatives up to an order r only slightly differ from 
the corresponding derivative of the metric g on a compact set U C M. The topology 
defined with the help of such neighbourhoods is called C’—-compact—open topology on 
Lor(M). If one assumes U = M, one obtains the so-called F’-open topology on 
Lor(M). 

It turns out that every C’*!—compact-—open topology is richer than C’-compact- 
open topology. Similarly, every F7*+!-open topology is richer than F’—open topology. 
And every F’—open topology is richer than C’- compact-open topology. 

Moreover, one can define the so-called W" Sobolev topology on Lor(M), by 
replacing the postulate of the closeness of metric derivatives with the postulate of 
the closeness of integrals of squares of these derivatives.® It can be shown that every 
W’*s (compact-open) Sobolev topology is richer than C’-compact-open topology 
which, in turn, is richer than W” Sobolev topology. 

This hierarchy of topologies on Lor(M) can be further enriched in many ways (for 
more details see Hawking 1971; Hawking and Ellis 1973, pp. 198, 252-253; Beem and 
Ehrlich 1981, pp. 28-29). Neither of these topologies is “natural” on Lor(M), and one 
must suitably choose different topologies for different problems. For instance, Sobolev 
topologies turn out to be suitable for discussing the stability of initial conditions 
for Einstein’s equations®, but to define the so-called stable causality’ compact-open 
topologies are preferred. However, when considering the ensemble problem, we are 
not interested in all Lorentz metrics on M, but in all those Lorentz metrics on M 
which are solutions of Einstein’s equations; they constitute the subset € of the space 
Lor(M). The space € usually has the structure of a smooth manifold. Moreover, 
there exists its local representation in the space of four functions of three variables. 
It turns out that Einstein’s equations act in € as a Hamiltonian system. 

One-parameter families of solutions for Einstein’s equations are curves in €, 


universe caused far too much of metaphysical emotions; see a responsible review by Demaret et 
Barbier (1981) and a balanced evaluation by Barrow (1983). However, the main reference to the 
anthropic principle, as far as its both physical and metaphysical aspects are concerned, is a book by 
Barrow and Tipler (1983). 

5These integrals and squares are defined in terms of a certain positively defined metric on M 
which always exists. 

®The problem is how to formalize the postulate that a slight change of the initial conditions leads 
to evolutions only slightly different from each other. 

7Space-time (M, g) is said to be stably causal if a slight perturbation of the metric g does not 
produce closed timelike curves. 
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and spaces of solutions of linearised Einstein’s equations form tangent spaces to € 
(fig. 5.1). Of course, one can meaningfully speak of a tangent space to € only if € has 
the structure of a smooth manifold in the neighbourhood of a given solution. In such 
a region, Einstein’s equations are said to be stable with respect to linearization. This 
is not always the case. One of the most interesting results, as far as investigations of 
the ensemble are concerned, is the theorem stating that Einstein’s equations (with the 
vanishing energy-momentum tensor) in the neighbourhood of a solution go € €, such 
that the space-time (M, go) has a compact Cauchy surface, are stable with respect 
to linearization if and only if the space-time (M, go) admits no Killing vector field 
(see Fisher and Marsden 1979, p. 206). This result points to the fact that space-times 
with symmetries (with Killing vector fields) violate the smooth manifold structure of 
the solution space €. Indeed, it turns out that the space € has a cone singularity in 
the naighbourhood of a solution with symmetries (see Fisher et al. 1980; Brill 1982).° 
This fact has serious consequences in the theory of “small perturbations” of Einstein 
equations (e.g., in the problem of the origin of galaxies and clusters of galaxies). Since 
in the neighbourhood of a solution with symmetries the space € is not stable with 
respect to linearization, small perturbations of such solutions produce, in principle, 
spurious perturbations (unless they satisfy the so-called Taub condition, see Fisher 
et al. 1980). 
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Fig. 5.1. The space of solutions of Einstein’s equations. 


Let these few results suffice as illustration of the subtle nature of the ensemble. 
Our aim is not to review this field of research (for more see Montcrief 1975; Marsden 
1981, chapters 9 and 10, and already quoted works), but only to point out that the 
ensemble, from its very nature, is a natural environment of cosmology. 


“Analogously to the “cone” F(X, Y, Z) = X* + Y? — Z? at the point (0, 0,0). Everywhere 
besides this point, the linearized equation dF/ds|,-9 = 2X X’ +2YY' —2ZZ' = 0 uniquely defines 
the tangent space. 
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It might turn out that the space of all solutions approach to cosmology be in- 
dispensable in formulating quantum theory of gravity and its applications to under- 
standing the evolution of the Universe. For instance, Hawking (1979, 1982) argues 
that a suitable method to quantize gravitational field should consist in adapting to 
this goal Feynman’s path-integral method. The method, as applied to gravitational 
field, consists in specifying a three-metric h; and the values of matter fields y, on a 
spacelike surface S,, and going to a three—metric h and the values of matter fields v2 
on a spacelike surface S2, as a sum over all four-geometries having S, and S> as their 
boundaries, with matter fields suitably matching y, and y2 correspondingly. The 
method was elaborated and applied to cosmology by Hartle and Hawking (1983) and 
by Hawking himself (1984). These authors assumed that spacelike three-geometries 
are compact (the closed universe). Quantum states of such a universe are described 
by a wave function defined as a functional on the compact three-geometries and the 
values of matter fields on them, and satisfying a second order functional differential 
equation, called the Wheeler-De Witt equation. Specifying the states of the universe 
is equivalent to specifying the boundary conditions for this equation. 

By a rotation to Euclidean time r (t +» —ir) in the action, one changes to the 
integration over all positive—definite four-geometries having suitable three-geometries 
as their boundaries (the so-called Euclidean approach, see Hawking 1979). Hartle 
and Hawking put forward a proposal that the ground-state amplitude for a given 
three-geometry is that given by an integral over all compact positive—definite four- 
geometries having this three-geometry as a boundary.® “This means that the Universe 
does not have any boundaries in space or time (at least in the Euclidean regime). 
One can interpret the functional integral over all compact four-geometries bounded 
by a given three-geometry as giving the amplitude for that three-geometry to arise 
from a zero three-geometry, i.e., a single point. In other words, the ground state is 
the amphtude of the Universe to appear from nothing” (Hartle and Hawking 1983, 
p. 2961). 

As it can be seen, the Hartle-Hawking method presupposes the space of all 
possible states of the Universe (of all compact three-geometries). This set is called 
superspace (having nothing in common with the superspace of supergravity theory), 
and in quantum cosmology it plays a similar role to that played by the ensemble in 
classical cosmology. 

Finally, let us note that historical developments of the relativistic cosmology 
can be naturally described in terms of the ensemble. Theoretical investigations in 
cosmology had penetrated larger and larger domains of the ensemble by discovering 
more and more new solutions. Later on theoreticians were interested in how certain 
properties of world models were distributed in the ensemble. It is interesting that 
world models first discovered historically (the static Einstein model, the empty de 
Sitter model ...) or classes of world models (spatially homogeneous but isotropic 


°Matter fields should be regular on these geometries, and only space-times with the cosmological 
constant A > 0 are considered, A = 0 is regarded as a limiting case. 
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models, spatially homogeneous but anisotropic models ...) usually turned out to 
be non-typical in the space of solutions. The direction of research in observational 
cosmology is just the opposite: investigations in this field always tend to narrow a 
certain domain in the solution space, i.e., to distinguishing this class of solutions (or, in 
the highly idealized case, this single solution) which best fits the actual observational 
data. Without its observational image modern cosmology would loose its identity. 
Only if we remember about its observational side can we claim that cosmology is a 
theory of the ensemble. 


3. Observational Problems of Cosmology 


Since Lemaitre’s work of 1927, in which this author for the first time demonstrated 
that actual red shift measurements in galactic spectra do not contradict predictions 
of a relativistic world model, comparisons of theoretical models with observational 
data became an important chapter of cosmology. Hubble’s (1936, 1937) ambition 
was to treat cosmology as an observational science in the following sense: the task 
of astronomical research is to describe the “observable universe”; if one assumes that 
it provides a “fair sample of the Universe”, a simple extrapolation will allow one to 
arrive at a satisfactory cosmological model. However, soon it became evident that 
Hubble’s approach was unrealistic. Modern astronomical (and radioastronomical) 
techniques reach such faraway domains of space-time that it is impossible to inter- 
pret the registered “data” without assuming a space-time model (at least as a working 
hypothesis). In practice, the following procedure, called the empirical testing of cos- 
mological models, has been established. Basing on certain postulates, usually inspired 
by philosophical views (such as cosmological principle, postulate of the spatial closure 
of the Universe, and so on) or by empirical results believed to be well-founded (e.g., 
symmetrical matter distribution in the Universe), a certain cosmological model or a 
class of models is assumed. In the second step, one deduces relationships between 
observable magnitudes (the so-called observables) from this world model or from this 
class of world models, and then one compares theoretically deduced relationships with 
those established observationally. Results of this procedure were initially only able 
to show that there were no contradictions between a given model (or a given class of 
models) and the observational data. 

The first major success of the observational cosmology consisted in making the 
steady-state cosmology (Bondi and Gold 1948, Hoyle 1948) “non-satisfactory” and 
“less attractive” than its rival evolutionary world models. However, this became 
possible, not owing to the traditional method of observational testing, but rather to 
the discovery of the microwave background radiation (Penzias and Wilson 1965, Dicke 
et al. 1965). The steady-state cosmology was not able to explain the genesis of this 
radiation. 

In the seventies the so-called standard cosmological model has been established 
as a result of several factors. Accumulation of new empirical data indeed seemed to 
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corroborate postulates assumed so far on purely theoretical basis (see, for instance, 
Symposium of the International Astronomical Union in 1973 on comparing obser- 
vational data with theoretical models (Longair 1974)). An important role was also 
played by stronger and stronger connections between the evolutionary world model 
and standard physics: high energy physics has found its “natural laboratory” in the 
very early Universe. On the other hand, typically relativistic methods began to infil- 
trate the field of astrophysics. People get gradually accustomed to the fact that the 
Universe is a relativistic object. However, during this entire period the method of 
testing of cosmological models was based on intuitions and many tacit assumptions, 
believed to be self-evident, rather than on a solid mathematical analysis of the prob- 
lem. It is symptomatic that a paper by Sandage (1961) was long treated as a classical 
work in this field in spite of the fact that it dealt with observational possibilities of 
the 200-inch telescope rather than with observational foundations of cosmology. 


A paper by Kristian and Sachs (1966) was the first in which the authors began 
systematically to analyse the problem. They treated it in a general way, not limiting 
themselves to a chosen class of models. They assumed that (1) space-time is a four- 
dimensional pseudo—Riemannian manifold with a structure enabling one to express 
all the investigated observational relationships as a series around the here-now of the 
earthly observer; (2) null geodesics are histories of light beams; (3) the dynamics of the 
world is ruled by Finstein’s field equations with the dust energy-momentum tensor. 
Assumption (1) is the most limiting one. It implies that all investigated relationships 
remain correct for space -time distances which are small when compared with both 
the scale factor of a given world model and the scale of local inhomogeneities. In the 
case of relativistic cosmology, assumption (2) can be deduced from the postulates of 
general relativity as the limit to geometric optics. In the work of Kristian and Sachs, 
assumption (3) interferes only with calculations leading to numerical estimates of 
some observational magnitudes; the authors succeeded in making no use of dynamical 
equations in all other considerations. 


However, cosmology owes systematical analysis of its observational possibilities 
to G.F.R. Ellis and his group. In a series of works (Ellis 1975, 1980, 1984; Ellis and 
Perry 1979; Ellis et al. 1978) the problem has been clearly stated, and some of its 
aspects discussed in a detailed way. An extensive paper of Ellis et al. (1985) marks 
the close of a stage of program implementation. It certainly deserves a closer look. 


In their first work, the authors attempted at decoding information of cosmo- 
logical significance contained in the ideal astronomical observations, 1.e., with the 
assumption that all measurements can be performed with any desired precision and 
without making use of any dynamical equations. The assumption remains in powcr 
that the pair (M,g), where M is a four-dimensional smooth manifold, and g a 
Lorentz metric defined on it, is a model of space-time. As usual, null geodesics are 
histories of photons, and timelike geodesics histories of freely falling particles with 
non-zero rest-mass. The program consists in expressing the metric g with the help 
of the so-called observational coordinates, i.e., such coordinates as can be directly 
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translated into observational terms. If derivatives of the components of the metric 
tensor (in observational coordinates) can also be determined by measurements, then, 
by computing the coordinates of the connection and of the curvature tensor in the 
usual way, one could fully reconstruct the space-time model from astronomical ob- 
servations. In this method one need not expand any observable into a series, and 
consequently one need no assumption about the smallness of the region controlled by 
actual observations. 

The observational coordinates {z'} = {w, y,9,y} are defined with respect to the 
history C of our Galaxy or of our Cluster of Galaxies; one assumes that this history 
is a suitably smooth timelike geodesic (at least in a neighbourhood of the instant in 
which observations are made) (see fig. 5.2). The surfaces w = wo = const are past 
light cones of events situated on the curve C. The parametrization of the curve C 
is normalised in such a way that w measures the proper time along C’. The instant 
w = Wo corresponds to the event q which is our “here and now”. 


L@® 


time 


space 
null geodesic {8,@ = const} 


observed event {w,7,8,9} 


the history of a galaxy 


Fig. 5.2. Observational coordinates. 


Let us denote the light cone of the event g by L~(q). Null geodesics generating 
L~(q) are given by {x? = 0 = const, x* = y = const} on the surface w = const. The 
coordinates 6 and y can be normalised in such a way that they become the usual 
spherical coordinates on the celestial sphere with respect to a non-rotating reference 
frame. 

The coordinate z' = y measures space-time distance along null geodesics. One 
should notice that the concept of a space-time distance contains both a space distance 
to the curve C' and a temporal distance to the event g. There are a few possibilities as 
to how practically identify the coordinate y with the actually measured “distances”. 

It may happen that the coordinates {w,y,6,y~} do not cover the entire space- 
time manifold (even if one does not take into account the coordinate singularity 
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8 = » = 0). However, they do cover the region NM C M which can be observed from 
the curve C' (these coordinates can also give a “one-many” covering, i.e., the same 
coordinates can designate distinct events; in such a case, ghost images of galaxies 
may appear). 

Usually, one assumes that the length of the period during which the humanity 
performs its astronomical observations is negligibly small as compared with the age 
of the Universe, and one can treat it as an event q on the curve C. If — with a 
bit of optimism — one replaces the event gq by a finite proper time interval Z, the 
observational situation is slightly improved (fig. 5.3). 


(m0, y,9,@) 


(w,7,9,9) 


Fig. 5.3. Observational situation with the assumption of a finite period 
of performing observations. 


The general conclusion from the detailed analysis — which, unfortunately, can- 
not be carried out here — is rather pessimistic (as it should be expected): ideal 
astronomical observations are not enough to uniquely reconstruct the geometry of 
space-time on our past light cone L~(q). It turns out, in particular, that we are 
not able to observationally determine (with no help of dynamical equations) how 
the three-dimensional hypersurface forming our past light cone is embedded in the 
four-dimensional space-time manifold. Moreover, it is impossible to demonstrate 
observationally that space-time is spherically symmetric around us; neither the op- 
posite statement can be demonstrated. These pessimistic conclusions remain in force 
even if one assumes that our observations cover a finite span of time. Therefore, 
observational data collected from L~(q) leave a considerable cosmological indeter- 
minacy: without help from dynamical equations (such as Einstein’s field equations) 
many different world models can still show consistency with observational data. 

How the situation is improved when one takes into account dynainical equations? 
One should be aware of the fact that after such equations have been accepted, obser- 
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vational data become theory—-impregnated to a much higher degree than before; and 
one cannot hope that one would still be able to discern in a cosmological model what 
comes from observation and what is theoretically introduced into it. The problem 
was addressed in the second part of the work by Ellis et al. (1985). Hinstein’s field 
equations have been assumed as dynamical equations in cosmology; therefore, in this 
work, observational implications of the relativistic cosmology are investigated. No 
space-time symmetries are a priori assumed. On the contrary, the aim of the work is 
to demonstrate the existence or non-existence of space-time symmetries (or of other 
properties of a model) from observations helped by the theory of general relativity. 

The observational coordinates are also used in this approach, but they have to 
be adapted to the field equations (afterwards they were analysed with the help of the 
Newman-Penrose formalism). The problem consists in determining (in terms of ideal 
astronomical observations) the Cauchy data for the field equations on the past light 
cone L~(q), and then in integrating these equations on and outside the past light 
cone L~(q). However, one must formulate the final value problem, characteristic for 
cosmology: one integrates the equations from the Cauchy data on L~(q) backward to 
a certain value y = y*. This limitation for the value of y is a consequence of the fact 
that the very early Universe was not transparent to electromagnetic radiation. The 
backward integration on the light cone L~(q) can meet obstacles in a caustic form 
which may have origin either in the topology of space-time or in the phenomenon of 
gravitational lensing. This, of course, creates serious difficulties for the program of 
observational cosmology (see Friedrich and Steward 1982). 

By using the field equations, the coordinate y can be (uniquely) replaced by the 
red shift z. To the limiting value y* now corresponds the value z*. 

The Cauchy data, consisting of the most detailed cosmological information we 
could hope to obtain, form the mazimal data set; let it be denoted by D(wo, z). 
It consists in the following ideal observations: red shifts, proper motions, distances 
to galaxies (the so-called area distances)!°, distortion of images of celestial objects 
and their groups (provided that “proper shapes” are known), results of galaxy and 
quasar counts up to a limiting value z = z*. It is worthwhile to notice that the 
background radiation is not an element of D(wo,z*). This radiation does not carry 
any information about particular regions of space-time, but rather imposes certain 
global constraints on the space-time structure which have to be satisfied by every 
cosmological model purporting to describe the real Universe. 

Here an interesting problem arises, namely the problem of the observational 
space-time. A space-time (M, q), being a solution of Einstein’s field equations, is said 
to be observational space-time associated with D(wo, z*) if it contains a hypersurface 
&' such that 4 U {q} is the past light cone L~(q) of an event q situated on a regular 
timelike curve C, and if all observational data as predicted by the model (M, q) 


This distance is determined by observing the pencil of null geodesics emanating from an obser- 
vation event q. If this pencil forms a solid angle d{2 at q and spans a space element dS at a certain 
point p along the pencil, then the area distance from q to p is dS = r7dQ. 
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are exactly those data which are elements of D(wo,z*). If there exist more than 
one observational space-time associated with a given D(wo,2*), one is met with an 
observational indeterminacy of the model: there are more than one cosmological 
models that are consistent with observational data contained in D(wo, 2*). 

Actual achievements of the Ellis program consist in demonstrating that the max- 
imal data set D(wo, z*) is simultaneously the smallest set of observational data re- 
quired to uniquely determine the geometry of the past light cone L~(q) backward to 
z =z". In other words, the knowledge of the ideal data D(wo, z*) is the necessary and 
sufficient condition for uniquely determining, with the help of Einstein’s equations, 
the geometry on our past light cone (backward to z*). Doubtlessly, it is an interesting 
result. One cannot see any reason why it should be a priori true. One could easily 
imagine the situation in which a model of the Universe would be either undetermined 
or overdetermined by the set of ideal observational data available in principle (the first 
possibility occurs where the field equations are not taken into account; see above). 


Geometric structure of the past light cone L~(q) is only a part of a cosmolog- 
ical model. Now, one should extend the solution obtained on the light cone to the 
Cauchy domain of dependence inside the cone. It is a difficult mathematical problem 
which can be successfully solved if relationships between mathematical quantities are 
analytic. However, there are some reasons suggesting that this assumption is not 
necessary. Since the analycity assumption is very restrictive, it would be desirable to 
have a proof without it. 

It is also possible to treat D(wo, z*) as the initial data and look for a solution 
to the future, rather than backwards. This is a still more difficult problem. From 
the hyperbolic character of Einstein’s equations it follows that one cannot expect a 
“deterministic” solution starting from a set of initial data, unless one assumes that 
the solution is analytic or that it cannot be perturbed by anything which could not 
be contained in the initial conditions (see sec. 1). The fact that, for a cosmologist, 
to reconstruct the past is more important than to forecast the future is hardly a 
consolation. 

The question of stability of observational space-times is an important problem 
which still awaits its full elaboration. Roughly speaking, an observational space-time 
(Mo, go) associated with the maximal data set Do(wo, 2”) is said to be observationally 
stable if a small perturbation D(wo, z*) of the set Do(wo, z*) leads to observational 
space-times (M,;,9;), associated with D(wo, 2”) such that (Mz, 9,) only slightly differ 
from (Mo, go). To define precisely the meaning of “slightly differs from” would require 
introducing a topology on the space of solutions to Einstein’s equations. The observa- 
tional stability condition should be satisfied by realistic world models. The situation 
in which slightly different observational data would lead to drastically different world 
models would be disastrous for any research in cosmology. 

Although the analysis of the observational situation in cosmology with the ideal 
observations replaced by the real ones is still to be carried out, it is not difficult to 
foresee its general result. If the precise knowledge of the ideal data D(wo, 2") is neces- 
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sary and sufficient to reconstruct the geometry of our past light cone, and since there 
is no doubt that the postulate to be realistic will restrict the data (to the set R(wo, 2*), 
say), one can expect a significant observational indeterminacy of cosmological models. 
Analogously to the concept of observational space-times associated with D(wo, z*), 
one can introduce the concept of realistic space-times associated with R(wo,z*). On 
the strength of the above analysis it is clear that the set of all realistic space-times 
associated with R(wo, 2*) consists of more than one space-time. This statement can 
be thought of as a formulation of the observational indeterminacy principle in cos- 
mology. It turns out to be an unavoidable consequence of the extrapolation method 
used in cosmological research. 

Observational indeterminacy in cosmology can be either acknowledged as a nec- 
essary element of our world picture, or neutralized with the help of different philosoph- 
ical postulates such as the cosmological principle claiming that the earthly observer 
occupies a typical position in the Universe (see Ellis 1975; Heller 1978). If one chooses 
the second possibility, one has to define cosmology not only as a science about the 
Universe, but also as a science about the assumptions which must be made in order 
to render such a science possible. If we take into account the fact that even the 
observational Ellis’ program is based on many (both explicit and tacit) assumptions 
(laws of physics are everywhere the same, space-time is modelled by the pair (M, 4g), 
etc.), it seems that the above definition of cosmology can hardly be avoided. 


83 


Chapter 6 


Geometry of the Standard Cosmological Model 


0. Introduction 


In the preceding chapter we have discussed some fundamental problems of cosmol- 
ogy understood as a non local physics. The discussion gave us a kind of program or 
suggested a style of research in cosmology. Unfortunately, the scope of the present 
book does not allow us to implement this challenge in a more ambitious way. Espe- 
cially, to deal with cosmology as a theory of the ensemble of universes would require 
a separate monograph. Moreover, it would seem more reasonable to postpone writing 
such a monograph till the domain is worked out more fully. Nevertheless, the present 
book cannot be complete without at least touching the space of solutions of Einstein’s 
equations. No wonder that in such a situation our choice goes to a subset (of the zero 
measure!) of this space, namely to the Robertson-Walker solutions. These solutions 
are distinguished: first, by the maximal degree of symmetry admitting the existence 
of the “open” global time; second, by the surprising property that there are exactly 
these simplest, in a sense, world models that correctly — albeit in an approximate 
way — describe the observed Universe. Because of the last property one sometimes 
speaks of the standard cosmological model. 

The present chapter is intended as an example of doing cosmology according 
to the program outlined in the preceding chapter (inasmuch as a zero -measure set 
can serve as an example of anything!). Therefore, from the very beginning we turn 
to global mathematical methods. They allow us to compactify the material and 
pay off with precise definitions of such concepts as particle with a non-zero rest— 
mass, photon, observer, etc. which have to be put into the formalism in order to 
change mathematics into cosmology. The books by Sachs and Wu (1977) and by 
O’Neill (1983) were both good examples and a great help. However, in the final 
parts of the chapter, concerning observational aspects of cosmology, we postpone this 
formalistic approach. It turns out to be too narrow for embracing the observational 
practice. To treat a galaxy or a cluster of galaxies as a curve in space time can be 
very precise from the mathematical point of view, but — if taken literally — does 
not allow one even meaningfully to speak about the angular diameter of a celestial 
object. Cosmology, in a very transparent way, reveals both the weakness and the 
strength of the empirical method: The weakness —- since the reality, as it manifests 
itself in empirical results, turns out to be too rich to be translated into the available 
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mathematical structures; and the strength — since simple mathematical structures 
surprisingly well approximate the reality. . 

The organisation of the present chapter is the following. In sec. 1, the concept of 
warped (Cartesian) product is introduced; it turns out to be a good tool significantly 
simplifying subsequent analyses. In sec. 2, the aforementioned definitions of particles 
and observers are presented, and some propositions connected with them are proved; 
they will serve us afterwards in shortening computations. The construction of the 
standard cosmological model (of the Robertson—Walker space-time) is carried out in 
sec. 3. The most important features of the evolution of the standard world model 
(singularities, spatial curvature, etc.) are expressed in the three theorems. The goal 
of sec. 4 is to formulate and to prove them. In cosmological formulations two sub- 
classes of the Robertson—Walker world models play an important role, namely dust— 
and radiation-filled universes. Solutions describing these models are presented in 
sec. 5. Observational aspects of the standard world model are (very) briefly discussed 
in sec. 6. Many comments and remarks can be made as far as the standard model is 
concerned; in sec. 7, we focus on the problem of idealisations that are incorporated 
into the construction of this model; we briefly mention solutions with the cosmological 
constant different from zero and world models with non-standard topologies (which 
are locally Robertson—Walker space-times); we touch upon a difficult problem con- 
cerning causally disconnected regions of space-time and the two attempts to solve it: 
the older one, known under the name of the mixmaster mechanism, and the newer 
one, the so-called inflationary scenario. 

The present chapter is followed by a Bibliographical Essay; its aim is to introduce 
the reader, who might feel disappointed with the brevity of the chapter dealing with 
the standard universe, to a fascinating realm of books on cosmology and related 
topics. 


1. Warped Products 


Let us consider a Cartesian product B x F of two pseudo-Riemannian manifolds 
(B, 9p) and (F, gr), where gg and gp are metric tensors on B and F, correspondingly. 
™*(gp) + o*(gr), where 7:B x F > B, o:B x F — F are projections, is a metric 
tensor on B x F. Let f > 0 be a smooth function on B. 

1.1. Definition. Cartesian product B x F, equipped with the metric tensor 


9 =" (98) + (f o7)’o"*(gr), 


is called warped product and denoted by M = B x ;F. 

If f = 1, the warped product is the usual Cartesian product. As it can be easily 
seen, warped product is a special case of a fibre bundle. Therefore, it is reasonable to 
call B a base manifold (or base, for short), and t~!(p) = p x F, p € B, a fibre over p 
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(fig. 6.1). Fibres and cross-sections o~'(q) = B x q, q € F, are pseudo-Riemannian 
manifolds. f is said to be a warping function. 

1.2. Example. All surfaces of revolution are warped products. Let M be a 
surface obtained by the revolution of a flat curve C around an axis in R°. The 
distance from the curve C to this axis is given by the function f:C > R!. One has 


N= Gur 8%, 


Fig. 6.1. Warped product M = B x yF. 


1.3. Definitions. Let B x F be a Cartesian product of manifolds B and F, 
and o the projections as above, and f a function on B. The function f = fom defined 
on B x F is said to be the lift of the function f to the product B x F. Let p € B, 
q€ Ff, and v € 7,(B). The unique vector 0 € 7,,.)(B x F), such that dx(#) = v, is 
said to be the lift of the vector v to (p,q) € B x F. 

In an analogous manner, one can lift functions and vectors from the manifold F 
to B x F. The above definitions remain valid also with respect to warped products. 

1.4. Definition. Let (M,gy) and (N,gn) be pseudo Riemann manifolds. A 
diffeomorphism 7: M — N such that ~*(gn) = cgm, where c = const # 0, is called 
a homotetic mapping, or a homotety for brevity (it is a special case of a conformal 
mapping which is defined by the condition: c = h is a different from zero function on 
M). |c|? is called a scale factor. 

It is easy to prove the following. 

1.5. Proposition. Let M = B x sF be a warped product, 

(a) for every q € F; the mapping 7|(gx,) is an isometry on B; 
(b) for every p € B, the mapping o|(p,) is a homotety with the scale factor 1/f(p); 
(c) for every (p,q) € M, the fibre 7~'(p) and the cross-section o~'(q) are orthogonal 

at (p,q). 9 

The properties expressed in this proposition allow one easily to reconstruct 
the geometry of the warped product 6B x ,;F when the geometry is known of 
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the manifolds B and F (see O’Neill 1983, pp. 204-211; Beem and Ehrlich 1981, 
pp. 55-79). 
The concept of warped product was introduced by Bishop and O’Neill(1969). 


2. Macroscopic Description of Particles and Observers 


Let us begin with definitions the aim of which is to make the description more precise 
of an “averaged matter content” of the Universe traditionally used in cosmology. In 
what follows, all structures are defined on the space-time manifold (M, g). 

2.1. Definition. A timelike future-oriented curve a:] — M, such that 
|a’(r)| = 1 for all r € J, is said to be an observer in space-time M; T is his proper 
time. 

2.2. Definition. <A timelike future-directed curve a:! — M, such that 
g(a’, a’) = (a’, a’) = —m’, is said to be a particle with the rest-mass m, or a particle 
for brevity. 

2.3. Definition. If the curve a of definition 2.1 or 2.2 is a geodesic curve, the 
observer or the particle is called freely falling. 

2.4. Definition. A future-directed zero geodesic 7:1 — M is said to be a 
photon or a light beam. 

Images Im(a), Im(y) C M of the curves a: — M and 7:1 — M are one- 
dimensional manifolds; they are called histories (or world lines) of observers or par- 
ticles. 

2.5. Definitions. Timelike future-directed unit vector v € T,(M) is said to be 
an instantaneous observer at q € M. By performing the orthogonal decomposition 
of the tangent space T,(M) = Ru+u" , one obtains the time azis Ru of the observer 
and its rest-space ut. 

One could imagine that an instantaneous observer is situated in 7,(M) and 
observes the Universe with the help of the exponential mapping exp,:T7,(M) — M. 
If u € 7,(M), then there exists a unique geodesic curve a such that a/(0) = u; 
the exponential mapping is defined by exp,(u) = a(1). By the phrase “to observe 
the space-time M from the tangent space 7J,(M) with the help of the exponential 
mapping” one should understand projecting structures from an open neighbourhood 
U Cc M of q by exp;' and treating them as structures on 7,(M) (see Sachs and Wu 
1977a, sec. 4.2). 

2.6. Definition. A unit vector field, the integral curves of which are observers 
(of definition 2.1), is said to be a reference frame in the space-time (M,g). Observers 
in question will be called observers in a given reference frame. 

Satisfactory agreement with actual observational data is achieved under a sur- 
prisingly strong simplification allowing one to treat the “material content of the Uni- 
verse” as a perfect fluid. In physics, perfect fluid is defined as a continuous medium 
moving, at each point q € M, with a velocity v, such that an observer (in an every- 
day meaning of this term), moving with the same velocity Vg, sees this medium as 
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isotropically distributed in space. Till the end of the present section, all analyses will 
be carried out locally, i.e., in a neighbourhood of a point q at which the perfect fluid 
approximation remains valid. The above ideas are formalized in the following. 

2.7. Definition. A perfect fluid in space-time (M,q) is the triple (U, p,p), 
where U is a reference frame, called the co-moving reference frame; p:M — (0, oo) is 
the so-called energy density function, and p: M — [0, 00) is the so-called pressure (or 
stress) function (it is assumed that p and p are of C® class). The energy-momentum 
tensor of the perfect fluid is defined to be 


T = (p+ p)U* @U* + pg, (2.1) 


where U* is a one-form metrically equivalent to the vector field U. 
2.8. Proposition. The components of the energy-momentum tensor of the 
perfect fluid (in the co-moving reference frame) are 


Too = p, T;; = p63, (t,7 =1,2,3). o 


Proof. Let U be the co-moving frame. We shall use the basis (U, X,Y) where 
Peele nen 2 (00 = pei, x) = 0; 7 (xX) = (X,Y). 0 

If p = 0, the perfect fluid is called dust; if p = 4p, the perfect fluid is called 
photon gas or radiation field. 

The local conservation laws requires div T = 0. 

2.9. Proposition. Let (U, p,p) be a perfect fluid. Div T = 0 implies 


(1) Up = —(p + p)divU; 
(2) — (grad p)* = (p+ p)VuU. m= 


Proof. Let *T be a tensor field of the type (2,0) metrically equivalent to the 
tensor field T. One has (divT = 0) => (div*T = 0), and one obtains 


div*T = U(p+p)U+(p+p)VuU + (p+ p)(divU)U + grad p'=0. 


The vertical component of this equation ((div*7T,U) = 0) gives equation (1), 
and its longitudinal component ((div*7, VuU) = 0) equation (2). 0 

Equation (1) expresses the rate of change of the energy density as measured 
by an observer in the co-moving reference frame U. If in equation (2) the space 
component of the gradient of the pressure function ((—grad p)*) is treated as a force, 
the coefficient (p + p) as a counterpart of mass, and VyrU as an acceleration, then 
equation (2) turns out to be Newton’s second law. 

The presented here formalised macroscopic description of particles and observers 
was originally proposed by Sachs and Wu (1977a, b), and its somewhat simplified form 
can be found in O’Neill (1983) (see also Heller 1982). 
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In a geometric approach to physics adopted also in the present book — in contradis- 
tinction to the purely mathematical texts — definitions are more important than 
theorems. Very often definitions express fundamental facts or experimental gener- 
alizations, whereas theorems are usually only their formal consequences. Through 
definitions the reality enters mathematics (see Sachs and Wu 1977a, sec. 1.5). The 
following definition constitutes both a mathematical stylization and a synthesis of 
many observational facts lying at the basis of the standard cosmological model (these 
facts will be briefly presented in sec. 6 of the present chapter). 

3.1. Definition. Let S be a connected three—dimensional Riemann manifold 
with constant curvature k = —1,0, or +1; J be an open interval of the space Rj, 
where R} denotes the space R’ with the signature 1; R(t) > 0, t € J be a smooth 
real function on J. The warped product M = I x RS is called a Robertson—Walker 
space-time. 

A Robertson—Walker space-time is, therefore, a manifold J x S with the line 
element 

ds? = —dt? + R?(t)do? (3.1) 
where do® is a line element on the manifold S lifted to J x S. Fibres S(t), t € J, 
are called instantaneous spaces or space sections (at an instant t) of the Robertson-— 
Walker space-time. Every instantaneous space S(t) has a constant curvature equal 
to k/R?(t). The warping function R(t), in this context, will be called the scaling 
function, and its value R(to), for an instant to € I, the scale factor. 

The line element a? can be reduced to the one of the following two forms, 


do? — —(a2')’ + (de*)* + (dz*)’ 
{1+ § [Cott + (#2)? + (@°]}° 

_ dr? + r7d6? + r? sin? Ody? 

So ea 


(3.2a) 


(3.26) 


Let d/dt be a vector field on I C R’, and U = 0/0t its lifting to 1x S. The curve 
Ixq,q€S, can be parametrized in the following way: a,(t) = (t,q). Of course, it is 
an integral curve of the field U, and consequently an observer in the reference frame U 
(definition 2.6); such an observer will be called a fundamental observer. Thus, the 
function ¢ ascribes to every fundamental observer his proper time; this function will 
be called the cosmic time function, or time function, for brevity. 

It can be easily checked that (U,U) = g(U,U) = —1, and U L S(t), for every t € 
I (see proposition 1.5c), i.e., every instantaneous space S(t) is a spacelike hypersurface 
in the space-time M = I] x RS. 

Let us introduce two further concepts playing important roles in cosmology. 

3.2. Definition. A space-time M = I x pS is said to be spatially isotropic with 
respect to a point q € S(t) if the point (t,q) € M has a neighbourhood N such that, 
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for any two unit vectors v and w tangent to S(t) at (t,q), there exists an isometry 
¢:N — N such that ¢ = id x |s and dd(v) = w. 

3.3. Definition. A space-time M = I x pS is said to be spatially homogeneous 
if, for p,q € S(t), t € I, there exists an isometry w:U — O, where U and O are 
neighbourhoods of p and q respectively. 

Less strictly but quite commonly, one simply says that the space S(t) is isotropic 
around a point g € S(t) or homogeneous. 

The concept of spatial isotropy formalises the intuitive statement that there are 
no privileged directions with respect to a point g. The concept of spatial homogeneity 
formalizes the intuitive statement that there are no privileged points in the considered 
space. These concepts can be more generally defined in terms of the Lie group theory 
(see Wolf 1967, Robertson and Noonan 1968). It is known from the Schur theorem 
that a space which is isotropic with respect to each of its points is also homogeneous 
and that a space is isotropic with respect to each of its points if and only if it is a 
space of constant curvature (see Wolf 1967, p. 57). 

From these considerations we obtain the following. 

3.4. Corollary. Instantaneous spaces of a Robertson—Walker space-time are 
isotropic around every of its points, and consequently they are also homogeneous. a 

This corollary plays an important role in observational motivation of the the 
choice of a Robertson~Walker space-time for the standard cosmological model. 

3.5. Proposition. For any Robertson—Walker space-time M = I x RS, in the 
reference frame U = 0/0t, non-vanishing components of the Ricci tensor are 


ms 
Roo = 35, 

R? k R 

Bi = Rar = Ras = —255 + 255 + R 


(notice the change of the Ricci tensor symbol as compared with chapter 4) and the 


curvature scalar is ; 
| k R 
an(et etal | 


Proof by direct computation from the definition; one can also employ the theo- 
rem on the Riemann and Ricci tensors in warped products (see O’Neill 1983, pp. 209- 
21). a 

Einstein’s equations introduce dynamics into the cosmological model. In the 
following, for the sake of simplicity, we will use Einstein’s equations in the form (5.7) 
of chapter 4, i.e., without the cosmological constant. This simplification is motivated 
by observational data which point out that cosmological constant effects at the present 
epoch are negligible within limits of observational accuracy (see Petrosian 1974). In 
the following, we shall use the geometric system of units in which the light velocity c 
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and the Newtonian constant of gravity G are assumed to be equal to one; in such a 
case the constant « appearing in equation (5.7) of chapter 4 has the value 87. 

3.6. Theorem. Perfect fluid (U, p, p) in the Robertson—Walker space-time M = 
I x RS is described by the following equations 


an OR? Ok 
3° Re Ee Re’ (3.3) 
ee a: 
Proof by direct computation from Einstein’s equations R,, — Fel = 8n J 


proposition 3.5 can be used. By comparing equations (3.3) and (3.4) with for- 
mula (2.1) one can see that these equations describe a perfect fluid. 0 

Definition 3.1, together with theorem 3.6 constitutes the essence of the con- 
struction of the standard cosmological model. 

3.7. Corollary. 


—4r(p+ 3p) = 35. oD (3.5) 


3.8. Corollary. Equation (1) of proposition 2.9 in a Robertson—Walker space— 
time assumes the form 


Up=p= -3(p +P). (3.6) 


(Equation (2) of the same proposition is trivially satisfied since in the considered case 


(grad p)+ = VyU =0.) a 


4. Evolution of the Standard Cosmological Model. 


The structure of the cosmological model presented in the preceding section is rich 
enough to contain in itself information concerning the world evolution and its limita- 
tions. This information is deciphered in the following theorems. 

4.1. Theorem. (O’Neill 1983, p. 348). Let M = I x pS be a Robertson—Walker 

space-time satisfying the condition: p+3p > 0 (the strong energy condition). If for a 
certain tp € I the relationship Hp = H(to) = Ro/ Ro > 0 is satisfied, then the interval 
I has the initial point ti, such that to - Hy’ < tin < to, and either (1) R > 0 or (2) 
R attains maximum at tmax, tmax > to. If (2) takes place, then I = (tin, tan). 
_ Proof. From the condition p + 3p > 0 and relationship (3.5) it follows that 
R < 0, 1.e., that the diagram of the function R(t) is a convex curve. In particular, 
this diagram is everywhere (besides the tangency point) beneath the diagram of the 
function 


Q(t) = R(to) + HoR(to)(t — to), 
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which is the tangent line to R(t) at the point R(to). The condition Ho > 0 implies 
vanishing of the function Q(t) at to — Hg’ (see, fig. 6.2). Since, however, the diagram 
of the function R(t) is a convex curve and R(t) cannot assume negative values, R(t) 
must break down at a certain instant ti, situated between to — iy and tg . 

From the fact that R(t) <0 on the entire /, it follows that either (1) R > 0 on 
the entire J or (2) R attains maximum for a certain tmax > to , and afterwards R<0. 
In the last case, the reasoning analogous to that in the first part of the proof leads to 
the conclusion that J must have the final point tg, > to. 0 


-1 
to -Hp fo tel 


Fig. 6.2. Breaking down of the history of the Robertson—Walker world model 
at the instant tin; to — Hes < tin < to 


Note that theorem 4.1 says nothing about the state of the Universe in the process 
of its approaching backward in time to the initial instant tj,; in particular, it does 
not say that p — oo and R ~ 0 whent — ti. 

4.2. Theorem. Let M x RS be a Robertson—Walker space-time with Hp > 0 
for a certain tp € J. If p > 0, -i <a<p/p,a=const, and R — 0 as t > ty, then 
R- co and p> co when t — tin. | 

Proof. From the assumption —} <a<p/p it follows that p+ 3p > ep > 0 for 
a certain € > 0. Therefore, theorem 4.1 applies; in particular R<0. 

Since in the interval (tin, to) one has R > 0, and on the strength of the above 
condition p — ep > —3p, equation (3.4) changes into the inequality 


R 
p = —(2 + €)Pm 
which, as can be easily seen, is equivalent to 
| d 
wees Rete = 0, 
melt es 
j.e., in the interval (tin,to) one has pR?** > p(to)R(to)*t*. However, from the as- 


sumption one knows that R — 0 if t — tin. Therefore, pR? — co and p — oo as 
t— tae 
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Equation (3.3) can be given the form 


8a 


“ae pR? = R? +k, (4.1) 
which immediately leads to the conclusion that also Ro owast— ty. O 

Therefore, the evolution of the standard cosmological model begins with the 
singularity, the so-called initial singularity. It is characterized by the fact that as 
time, going backwards, approaches the initial instant, t — ti,, both the energy density 
p and the expansion rate R grow indefinitely. This fact justifies the name Big Bang. 

4.3. Theorem. Let M =I x pS be a Robertson-Walker space-time satisfying 
the conditions of theorems 4.1 and 4.2. (A) If R > 0 on the entire interval I = (tin, 00), 
then either k = 0 or k = —1. (B) If R attains maximum at tmax € J, then k = +1 
(and I = (tin, tin)), and if R > 0, as t — ten, then p — oo and R -» —oo, ast — tfn@ 

Proof. (A) First, let us assume that R > 0 on the entire interval I = (tin, 00). 
As in the preceding theorem, p > 0 and p+3p > 0, which implies p+p > 0. Therefore, 
equation (3.6) leads to the conclusion that p < 0, i.e., p decreases indefinitely. Two 
cases ought to be considered: 

(1) R — o as t > oo. From the proof of theorem 4.2 it is known that 
£ (pR?**) < 0 (but now on the entire [) which implies that pR?** is a bounded 
function for large t € J. In other words, pR*? — 0 as t > oo. From equation (4.1) 
one gets either k = 0 or k = —1. 

(2) However, the case R — b = const as t — oo cannot be excluded. With this 
assumption, and remembering that R <0, one obtains that R > 0. This, together 
with equation (4.1) leads to the conclusion that either k = 0 or k = +1. However, 
the case k = +1 is excluded. Indeed, let us assume, on the contrary, that k = +1. 
From equation (4.1) one gets pR? — 3/(8), and consequently p > 6 for a certain 
5 > 0. Since R — 0 as t + oo, there exists a sequence {t;} > oo, t; € I, such that 
{R(t;)} > 0. Therefore, equation (3.5) gives {(p + 3p)(t;)} — 0, but this contradicts 
the inequality p + 3p > ep > 0 (see, beginning of the proof of theorem 4.2). 

Therefore, if R > 0 on the entire J, then either k = 0 or k = —1. (It can be 
shown that the case R — b as t — oo also excludes k = 0; see O’Neill 1983, p. 349.) 

(B) Let us assume that R attains maximum at tmax € J, i.e., Rites) — 
Equation (3.3) gives p(tmax) = 3k/87R?(tmax) > 0; hence k = +1. The rest of the 
proof is similar to that of theorem 4.2. o 

4.4. Corollary. Let us define the so-called critical density fa = Sefer 
Po = Perit, then k = +1 (index zero denotes the value of the corresponding parameter 
at a certain instant to € /, for example at the instant of performing observation). = 

Proof. From equation (3.3) one gets po — pait = 3k/87R2. 0 
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5. Dust Filled and Radiation Filled Universes 


Two special cases of perfect fluid Robertson- Walker space-times play an important 
role in cosmological applications, namely dust filled and radiation filled world models. 
We will consider them in turn. 

5.1. Proposition. The scaling function R(t) of a Robertson—Walker space-time 
M =I x pS filled with dust (U, p, p = 0) satisfies the so-called Friedman equation 


-» 8 & 
Ye Sp TR k, (5.1) 
where EF = pR° > 0 is a conserved magnitude on the entire interval J. = 

Proof. Equation (5.1) is equivalent to equation (3.3). Equation (3.6) with p = 0, 
after simple transformations, gives d(pR*)/dt =0. a 

Solutions of the Friedman equation describe the evolution of the cosmological 
model. In the considered case, these solutions — for different values of the parameter 
k — are the following, 


ki=.0: R= Cet?/5, C= bre: 
i a t = $n E(0 — sin 8); 

R= $xE(0 — cos8), (0 <0 < 2x); 
k=-1: t = 4nE(sinh ny — ); 


R= 4nE(coshn — 1) (n > 0). 


These solutions are shown in fig. 6.3; they are often called dust filled Friedman 
universes, or Friedman universes for short. Their evolution starts with the Big Bang 
(the instant t;, of the Big Bang has been accepted as the beginning of time counting, 
i.e., tin = 0). In the case k = 0 (the flat model), R — co and RO ast — oo. This 
solution is also called the Einstein-de Sitter model. If k = +1, the solution is a branch 
of the cycloid. The maximum occurs at tmax = (4/3)1?E and is Rmax = 8r E/3. The 
final singularity takes place at tg, = 877E/3. In the case k = —1, R— coand R- 1 
as t > 00. 


Fig. 6.3. Friedman’s world models. 
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5.2. Proposition. The scaling function R(t) of a Robertson-Walker space-time 
M =1 x pS filled with radiation (U, p, p = p/3) satisfies the equation 


7? Sele (5.2) 


where ¢ = pR?’ is a conserved magnitude on the entire interval /. ™ 

Proof. Equations (3.4) and (3.5), after straightforward manipulations taking 
into account that p = p/3, give equation (5.2). In this case, equation (3.6) turns out 
to be equivalent to d(pR*)dt = 0. 0 

The solutions of equation (5.2) are the following, 


k =0: R = (20t)?; 
cle R= (2bt — t?)?; 
k=-1: R = (2bt + t?)2. 


a 
where b = (87e/3)?. 
5.3. Corollary. If t — tin, all solutions behave as the flat solution, i.e., curvature 
effects become negligible. o 


6. Observational Background of the Standard Cosmological Model 


So far analyses of the present chapter had a purely mathematical character. Def- 
initions of observers, particles, perfect fluid etc. (see, sec. 2) were very formal and 
rather loosely connected with physical or astronomical practice. Now, the formalism 
must be put into resonance with the results of measurements and observations. In 
other words, mathematical definitions, theorems and corollaries must be changed into 
what is usually called a cosmological model. These procedure immediately relaxes 
the entire formal fabric. Even a high idealisation of physical objects (e.g., of galaxies 
to points in space or to curves in space-time, see below) impresses on mathematics 
a stigma of empirical indeterminacy. Therefore, in the present section we postpone 
our previous mathematical style of exposition and change into a more “narrative lan- 
guage”. Consequently, terms such as observer, particle, and so on, will be understood 
in their intuitive everyday meaning, and not in the sense of definitions 2.1—2.7. 

First of all, one should be aware of the kind of vicious circle in which every 
empirical testing of cosmological models is entangled (in fact, it is a feature of the 
empirical method as such, but in cosmology it could be more dangerous than in other 
branches of the empirical sciences). On the one hand, without the help of a theory 
or of a model one does not know what and how to measure, and in which way the 
obtained “data” should be interpreted. On the other hand, with no observational data 
one could hardly create a theory or a model, not to mention testing of their validity. 
This vicious circle has to be somehow broken down. The method — honoured by 
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practice and supported by surprising, albeit not quite understandable, successes — 
proceeds in the following way. At the starting point one accepts (usually tacitly) the 
simplest model; the simplest in this context means remaining in agreement with a 
scientific tradition or with intellectual habits. Conceptual tools such a model supplies 
allow one to gather and to interpret observational data. As long as consistency of 
interpreted data with the model can be maintained, there is no reason to look for 
another model. Only under the pressure of increasing inconsistencies and anomalies, 
scientists begin (not without a certain unwillingness) to revise the model, to change 
some of its assumptions, to revaluate some of its argumentation methods ... This 
process usually leads to a new model which guarantees, at least for the time being, 
agreement of the theory with observational data. If necessary, the history repeats 
itself. 

Cosmology of the twentieth century began its evolution from a traditional and 
intuitive model of a static flat and Euclidean space and an absolute time. Scientists 
believed that such a model was implied by the Newtonian theory of gravity (or at 
least it did not contradict the Newtonian theory) and remains in agreement with 
astronomical observations.! Gradually, observational data discarded this model. The 
standard cosmological model emerged as a better approximation to the reality; within 
its framework a satisfactory agreement has been reached between the theory and ob- 
servation. However, it cannot be thought of as “empirically proved” but only as a 
well-functioning model providing a consistent interpretation of empirical data. Ex- 
clusively in this sense, in the following we shall speak about observational verification 
of the standard world model. 

First of all, one must observationally verify definition 3.1 which constitutes the 
core of the standard model. If this definition is to be referred to the physical real- 
ity, the space-time M of the Universe is supposed to be of the form of the warped 
product M = / x RS, where S(t) are instantaneous spaces of constant curvature, 
and the parameter t € / represents the cosmic time. The empirical motivation for 
the isotropy and homogeneity of instantaneous spaces is provided by (A) measure- 
ments of the microwave background radiation and (B) investigations of the spatial 
distribution of galaxies. The information that R(t) # const is supplied by (C) red 
shift measurements in galactic spectra. The family of three-dimensional homoge- 
neous and isotropic spaces’ can be, in a natural way, parametrized by the parameter 
t € I; this defines the projection 7: M — I| which can be interpreted as the existence 
of the cosmic time in the model (see Appendix). In what follows we shall notice some 
difficulties which this apparently simple model encounters. 

(A) The microwave background radiation has the following properties interesting 
from the cosmological point of view (their mere enumeration allows one to see to which 
extent establishing of these properties is the result of both theory and observation. 


1 As it is shown in the Appendix, such a model is in fact not consistent with Newton’s theory. 
2More precisely, these spaces are defined by the action of certain symmetry groups, see, for 
example, MacCallum 1973. 
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(1) Observations are carried out at wavelengths 0.1-10cm. The radiation does 
not come from any identifiable discrete sources. Most probably, it was emitted “by the 
Universe” about 105-108 years after the Big Bang; before that date, called the epoch, 
or sometimes, more geometrically, the surface, of the last scattering, photon remained 
in equilibrium with other components of the primeval plasma. At the last scattering 
epoch the Universe became transparent for microwave photons; one speaks of photons 
decoupled from other forms of matter. If this interpretation is correct, the microwave 
background radiation carries the information about the Universe as regards how it 
was at the last scattering epoch. In other words, the background radiation allows us 
to penetrate backward up to 99.999 % of the evolution of the Universe. 

(2) The background radiation spectrum has, up to good approximation, a Planck 
spectrum of the black body at temperature T = 2.7K. This means that, at the epoch 
of emission of the background photons, the Universe was in the thermodynamical 
equilibrium. 

(3) The background radiation possesses a high degree of isotropy. Expected 
deviations from isotropy can be of a threefold kind: 

(a) anisotropy with a 24-hour period (dipole anisotropy); it is caused by the Earth 
motion with respect to the radiation field (once every 24 hours antenna crosses the 
direction in which the Earth moves with respect to the background radiation field); 

(b) anisotropy with a 12-hour period (quadrupole anisotropy); it could serve as 
a test of the anisotropic expansion of the Universe at the last scattering epoch (twice 
every 12 hours antenna crosses a privileged direction of the world’s expansion); 

(c) anisotropy on small scales would constitute an empirical proof of small scale 
inhomogeneities of the Universe at the last scattering surface (the radiation was scat- 
tered on inhomogeneities). 

The anisotropy with a 24-hour period has indeed been found. It turns out that 
— after taking into account corrections for the Earth’s motion around the Sun and 
the Sun’s motion around the centre of our Galaxy — one must ascribe to our Galaxy 
a speed of about 540km-s~' with respect to the radiation field (in the direction: 
a = 10",7, 6 = —22°). It is worth noticing that the background radiation field is a 
physical realization of the idea of a co-moving reference frame. 

The background radiation field has a high degree of isotropy of types (b) and (c). 
The upper limit of the temperature T anisotropy, at any scale, is AT/T < 1073, and 
for angular scales less than 3°, AT/T < 107°. The last result comes as a surprise since, 
according to present theories of formation of galaxies and cluster of galaxies, pregalac- 
tic condensations of matter should manifest themselves in small scale anisotropies of 
the background radiation. The actually measured degree of the isotropy puts severe 
constraints on theories of galaxy formation. 

To sum up, one must say that observational data concerning the microwave 
background radiation inform us about: (1) a high degree of its isotropy with respect 
to the Earthly observer (after taking into account the observer’s motion with respect 
to the co-moving frame); (2) homogeneity of matter distribution on the last scattering 
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surface (the lack of small scale inhomogeneities); (3) the isotropic expansion at the 
last scattering epoch (the lack of 12-hour anisotropies). The often overlooked fact 
is worth noticing; it is that the background photons carry information concerning 
not only the spatial isotropy of the matter distribution in the Universe, but also 
its spatial homogeneity (conclusion (2)). (More about cosmological significance of 
the microwave background radiation, see Peebles 1971, pp. 121-158; Weinberg 1972, 
pp. 506-228; Sciama 1971, pp. 176-203; Demiariski 1985, pp. 274-279; Raine 1981b, 
pp. 44-61; Bajtlik 1984; Kolb and Turner 1990, pp. 14-15.) Recent observational 
results and their analysis can be found in the thesis by Bajtlik (1984).)° 

(B) Investigations of the spatial distribution of galaxies is a difficult observational 
task. Till not long ago, distances to very few galaxies were known (with large mea- 
surement errors); without knowing the “third coordinate” one could only determine 
the projection of the galaxy distribution onto the celestial sphere. The reconstruction 
of the three-dimensional distribution from observational data (with no knowledge of 
the third coordinate) is not unique, and leads only to probable result. From among 
several methods used to reach the goal, the so-called correlation function method 
turned out to be the most effective one. Two-point correlation function measures 
the probability of finding (on the celestial sphere) two galaxies such that the distance 
between them is smaller than the average distance between two neighbouring galax- 
ies. Three-, four-, ..., and n-point correlation functions are defined analogously 
(for precise definitions, see Peebles 1980, pp. 143-152). Time and effort consuming 
investigations, carried out with the help of this method, have led to the following 
results: 

(1) For large angular distances the correlation function tends to zero. This means 
that the distribution of galaxies is homogeneous at large. Observations suggest that 
the world can be regarded as spatially homogeneous at distance scales larger than 
108 ps.* 

(2) On smaller scales a strong effect occurs of the clustering of galaxies. This 
effect remains in agreement with the well-established existence of smaller aggregations 
of galaxies (such as the Local Group), clusters, and superclusters of galaxies. 

(3) The investigations do not show any characteristic scale of clustering. This 
means that there is no privileged unit (cluster of galaxies, say) which could be re- 
garded as a fundamental building-block of the Universe. The average distribution of 
such building blocks would implement the observed large scale distribution of matter. 
This suggests the hypothesis that there is clustering of galaxies on all scales: smaller 
aggregates tend to be clustered into larger ones, but this tendency is less and less 
pronounced as one goes to larger and larger distances. 


3The above presented results have been remarkably improved by the very successful satellite 
COBE (COsmic Background Explorer) launched in November 1989. Preliminary measurements fit 
perfectly to a Planck blackbody radiation spectrum for a temperature of 2.735 + 0.06 kK. At the 1% 
level there are no deviations from an ideal blackbody spectrum. Dipole and quadrupole anisotropies 
are not observed. See B. Schwarzschild, Phystcs Today, 43, 1990, 17-20. (Note added in proof.) 

41 ps = 206265 AU = 3.263 light years = 3.083 x 10'%km (AU stands for Astronomical Unit). 
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Many red shift measurements of galactic spectra have been recently obtained 
(see below (C)). By assuming that the red shifts provide information on distances to 
galaxies, the lacking third coordinate can be determined for a considerable number 
of galaxies, and three-dimensional charts of the sky can be constructed. This kind of 
research has confirmed the previous results but, quite unexpectedly, revealed a pecu- 
liar structure of clustering on smaller scales. It turns out that galaxies are clustering 
in elongated membrane structures the linear size of which is of order of magnitude 
larger than the transversal one. Between membranes crossing each other in an irregu- 
lar manner, large domains extend devoid of galaxies. Within the membranes galaxies 
are distributed homogeneously with a tendency of clustering on all scales. (See Silk 
et al. 1983; more about distribution of galaxies in the Universe, Peebles 1980; Groth 
et al. 1977; Raine 1971b, pp. 1-33, 62-83; Kolb and Turner 1990, pp. 16-26.) 

Important cosmological information can be obtained by comparing empirical re- 
sults concerning the microwave background radiation and the distribution of galaxies. 
In this way, one can get, as it were, two photographs of the Universe: the background 
radiation conveys information about the structure of the last scattering surface, the 
distribution of galaxies—-about the “present” cosmological epoch. It turns out that 
the evolution leads the Universe towards the destruction of homogeneity. This effect 
should be ascribed to a strongly unstable character of gravitational field which con- 
sists in a tendency to strengthen all casual matter inhomogeneities. Onto this general 
trend other physical mechanisms are superimposed the nature of which has not been 
fully understood. The origin and evolution of structures in the Universe is one of the 
most difficult problems of contemporary cosmology. 

(C) Red shifts of the spectra of galaxies were the first significant observable of 
the relativistic cosmology, it played an exceptional role in its history. Now, we will 
show how red shifts are connected with theoretical concepts appearing in the standard 
model. 

Let us consider Robertson—Walker line element (3.1)-(3.2b). Let us assume that 
at an instant ¢, of the cosmological time a photon is emitted from a galaxy situated 
at the point (r;,41,¢1) in the direction r (i.e., 6 = y = 0). The photon is registered 
on the Earth, situated at (0,0,0), at the instant to. Taking into account that light 
moves along null geodesics, one obtains 


Be: a 
20 cane ye 
1 4 


Since the coordinates (71,1, 1) of the galaxy are co-moving coordinates, f(r;) 
does not depend on time; it can be thought of as measuring a conventional distance 
corresponding to the time difference to — t; (conventional since it is not directly 
measurable). If the subsequent photon is emitted from the galaxy at an instant 
t; + At, and registered on the Earth at to + Ato with At,, Ato < to — t;, one has 


to+Ato dt 
ik. R(t) = FGi); 
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The last formula is often expressed in terms of the wavelength \, (or frequency 
1) of the emitted light and of the wavelength Ao (or frequency vo) of the registered 
light; in such a case we have 
Ai Vo wt: R(ty) 
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Customarily, one defines the red shift parameter (called also red shift, for short) 


z = (Ao — Ax)/A; which gives 


=. (6.2) 


If the Universe expands, R(to) > R(t,) and z is positive, i.e., the registered light 
wave is shorter than the emitted one. If the Universe contracts, the situation is just 
the opposite, in which case one obtains blue shift. By expanding (6.1) and (6.2) 
with respect to the parameter h = (tp — t;)/R(to), which is assumed to be small (i.e. 
one could hope that the procedure is valid for not too distant galaxies), and then by 
eliminating this parameter from expansions of (6.1) and (6.2), one obtains 


z = R(to)f(r1). 


This gives a linear dependence of red shift on the “distance” f(r,). Distances, 
actually measured by astronomers (see Weinberg 1972, pp. 418-427) turn out to be 
proportional to R(to)f(r1), symbolically d = R(to)f(r1), i-e., 


d. (6.3) 


This is the famous Hubble’s law, discovered observationally by this astronomer in 1929. 
It was this law that, before the discovery of the background radiation, constituted 
the main argument of the “expanding Universe”. 

Before the discovery of the microwave background radiation the so-called clas- 
sical cosmological tests were expected to provide relevant observational information. 
The idea consists in identifying actually measured astronomical quantities (sometimes 
called observables) with theoretical magnitudes appearing in a model, and comparing 
dependencies between pairs of them, as predicted by various models, with those estab- 
lished experimentally. The most important dependencies are: red shift — angular size 
of objects, red shift — stellar magnitude, counting of objects up to a certain limiting 
magnitude® (see for instance, Weinberg 1972, pp. 441-458; Raine 1981b, pp. 153- 
166). However, so far the efficiency of these tests has proved to be rather limited. 


5In cosmological tests magnitude often serves as a distance indicator. 
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They are based on a number of in principle unverifiable assumptions which have to 
fill in gaps in our knowledge, especially as far as the evolution of objects is concerned 
(such as galaxies, radiogalaxies, quasars). By maneuvering “free parameters” many 
different cosmological models can be fitted to actually observed dependencies. The 
most significant result obtained with the help of classical tests is the support given 
by them to the idea of the evolutionary Universe against the steady-state cosmology 
of Bondi and Gold (1948), and Hoyle (1948). 

To sum up, one must say that the standard world model presented in the present 
chapter does not contradict any observational data available at present and, although 
none of known tests “proves” it directly, all tests together strongly suggest that the 
standard model — up to a good approximation — correctly describes the observed 
Universe. 

We should notice another important circumstance as far as observational aspects 
of cosmology are concerned. Because of the final velocity of light signals, the so-called 
horizon may appear in the Universe limiting the field of vision of a given observer. 
At a moment ¢, an observer situated at the beginning of the co-moving coordinate 
system, r = 0, can receive signals emitted by galaxies (more strictly by particles 
at rest with respect to the co-moving coordinate system), the radial coordinate of 
which, r, is smaller than r;, where r, is the coordinate of a galaxy such that the light 
signal emitted from it at t, reaches the observer (situated at r = 0) at t. t, can be 
computed from the formula (see formula (6.1)) 


a dr ie dt a 
o i+tr? J, R(t) (6.4) 


If the integral on the right-hand side of (6.4) converges for t; — 0 (or in world models 
without the initial singularity, for t; > —oo), then there exist galaxies such that light 
signals emitted from them do not have enough time to reach the observer at r = 0; 
such a model is said to have a particle horizon. 

It can also happen that the integral on the right-hand side of formula (6.4) 
converges for t > tmax (tmax can be equal to +00 or to the moment of the final 
singularity); if this is the case, there exist galaxies such that light signals emitted 
from them will never reach the observer situated at r = 0; such a model is said to 
have an event horizon. (More about horizons, see Rindler 1946; 1969, pp. 240-242; 
Weinberg 1972, pp. 489-491; Hawking and Ellis 1973, pp. 127-130.) 

The existence of horizons imposes strong constraints upon observational possibil- 
ities of cosmology. If in our Universe there are horizons, and if we want to extrapolate 
our knowledge acquired within the horizons to regions situated outside them, we have 
to make strong assumptions that would justify such a procedure. Assumptions of this 
kind will always be of an empirically unverifiable character. 
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A. After having made the first contact with the standard cosmological model, one is 
struck by a certain tension between the degree of idealization incorporated into this 
model and the agreement of its predictions with observational data. Definitions of 
particle, photon, etc. from the beginning of sec. 2 allow one to “theoretically identify” 
the material content of the standard model; however, to coordinate physical objects to 
particular mathematical structures becomes a very difficult task. For instance, in the 
standard model, density of particles with non-zero rest-imass satisfies the relationship 
Pmat ~ R-%(t), whereas for particles with zero rest-mass one has praq ~ R74(t) 
(number of photons in the volume unit changes as R~%, and the energy of each 
photon as R~'). If, therefore, in the present Universe, there is even a slight amount 
of radiation, there had to be in its past an epoch (when R was sufficiently small) in 
which the radiation dominated the dynamics of the world. If the present Universe is 
correctly described by a dust-filled model with even a slight admixture of radiation, 
its sufficiently distant past must be described by a radiation-filled model. 

In its “dust period” the model is filled with dust particles, i.e., with timelike 
geodesics (definition 2.2), but it is less clear with which physical objects a “theoretical 
dust particle” should be identified. From the theoretical point of view, dust particles 
are distributed throughout the space of the model in a homogeneous and isotropic 
manner, and consequently one must look for real objects on the level of which the 
spatial homogeneity and isotropy would be guaranteed. Galaxies for sure, and clusters 
of galaxies most probably, are not such objects. If a version of the hypothesis is true 
of clustering of galaxies on all scales, it might turn out that such objects simply do 
not exist. Having this in mind, successes of the standard world model in organizing 
observational material are really astonishing. (For a more detailed discussion, see 


Heller 1982.) 


B. All solutions of Einstein’s equations, considered in the present chapter, have 
been obtained with the assumption that the cosmological constant A is equal to zero 
(see chapter 4, sec. 5). By rejecting this assumption one obtains a much richer class of 
solutions. Counterparts of equations (3.3) and (3.4) with non vanishing cosmological 
constant are the following, 


81 R? k l 
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Solutions of these equations, for different values of the parameters A and k, with 
the equation of state p = 0 (dust), are shown in fig. 6.4 and 6.5; the table gives 
their straightforward classification. (For more details, see Tolman 1934, pp. 394-419; 
Rindler 1977, pp. 235-238; Weinberg 1972, pp. 613-616.) 
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Fig. 6.4. World models with k = +1, p = 0, for different values 
of the cosmological constant. 


Einstein (1917) introduced the cosmological constant A to obtain a static world 
model; however, after finally convincing himself that the Universe really expands, he 
considered it to be superfluous (see Einstein 1958). Observational data testify that 
the influence of the cosmological constant upon the dynamic of the planetary system 
is negligible. The interest in cosmological models with the cosmological constant 
increased around 1967 when a surplus of number was discovered of quasars with 
red shifts about z = 2. It seemed that this suggested the existence of an epoch in 
which the rate of expansion slowed down almost to zero. Such an epoch appears only 
if the value of the cosmological constant A is slightly larger than Ag (the so-called 
Lemaitre’s models, see model M, in fig. 6.4). When afterwards it had turned out that 
it was only a kind of a selection effect, models with A = 0 became again fashionable 
(see Petrosian 1974). 


Fig. 6.5. World models with k = 0, p = 0, for different values 
of the cosmological constant. 
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Table. World models with the cosmological constant. 


== 
Models M2 


M, — monotonically expanding 
models with the initial singularity. 
M, — de S ast + 00 
R< Rg: A; — first asymptotic 
model. A, — E as t > co 
R= Rg: E — static Einstein’s 
model. 
R> Rg: Az — second asymptotic 
model. A, > E as t + co 
R < Rg: O — oscillating models 
with the initial and final singularities 
R> Re: Mz — monotonically 
expanding models without the 
initial singularity. 
Mz — de S ast > 00 


Models O 
Models O Models O 


de S denotes de Sitter’s world model; Ag and Rp are values of the cosmological constant and of the 
scale factor characteristic for the static Einstein’s model, correspondingly. 


Models M, 


The cosmological constant has returned to scientific papers in connection with 
grand unification theories. It was known for a long time that dynamical effects of 
the cosmological constant appear as “corrections” to the density and pressure: per = 
p+ A/(8r), per = p — A/(87) (see Weinberg 1972, p. 614). The energy density of 
the vacuum presupposed by unifying theories is thought of as being connected with 
these corrections (see, for instance, Kazanas 1980, Abbot 1988). 


C. Einstein’s equations, being partial differential equations, determine the struc- 
ture of space-time only locally. In particular, they do not contain any global informa- 
tion about space-time topology. For instance, for a given space time (M, gq), there 
exist a simply connected universal covering manifold’ M equipped with the metric g 
such that (M,g) can be obtained from (M,g) by suitably identifying points in M. 
If a solution of Einstein’s equations is given, it is always possible to find its universal 
covering (M,g), and to perform all possible identifications in it (if space time has 
no symmetries its universal covering coincides with it). Space-times obtained in this 


®To obtain a static solution from the field equation without the cosmological constants requires 
a negative pressure. The cosmological constant was introduced by Einstein in order to avoid this 
“non-physical” effect. 

"Definitions of these concept can be found, for example, in a book by O'Neill (1983, pp. 441 445). 


104 ; 7. Comments and Remarks 


way are automatically solutions of Einstein’s equations, but they can drastically differ 
from each other from the topological point of view. In sec. 3 of the present chap- 
ter, it has been said that the standard choice of topology for instantaneous spaces 
are hyperbolic space H? for k = —1, Euclidean space R? for k = 0, and spheri- 
cal space S* for k = +1. However, the problem could be stated in a more general 
way, namely, one could ask about all possible space-times which locally are expand- 
ing Robertson—-Walker worlds with instantaneous spaces of constant curvature. The 
answer to this question gives globally different cosmological models that locally are 
non-distinguishable from each other (and from the standard model) as far as their 
observational aspects are concerned. Global differences between these models could 
manifest themselves in the possibility of observing the same object more than once 
on the celestial sphere (the so-called ghost images). 

Mathematically, the question is reduced to the classical problem of Clifford—Kiein 
space forms (see Wolf 1967, and in a more concise way O’Neill 1983, pp. 243-248). It 
was Ellis (1971) who considered this problem in a cosmological context. It turns out 
that there exist infinitely many space forms® locally reproducing Robertson—Walker 
universes. In the case k = +1, all these space forms are compact but in the cases 


k = 0 and k = —1, there exist both compact and non-compact space forms. This 
means that not all world models with k = 0 and k = —1 should be regarded as 
“open”. 


This problem is strictly connected with the extrapolation problem in cosmology 
which essentially reduces itself to the question: how spaces, determined locally, could 
be extended to non-local scales (see Heller 1978)? Some observational aspects of this 
problem have been discussed by Ellis (1983). 


D. The existence of horizons in some world models (see end of sec. 6) also leads 
to difficult cosmological problems. Let us notice that two domains of space-time 
separated by a particle horizon never have interacted with each other. It can be 
shown that domains, distant from each other by no more than 75° on the celestial 
sphere, were separated by the particle horizon at the epoch at which the microwave 
background radiation interacted for the last time with other forms of matter filling the 
Universe (see, for example, Weinberg 1972, pp. 525-526). If one admits that physical 
interactions smoothing out the Universe cannot propagate with a speed greater than 
that of light, one cannot see any reason why the background radiation, coming from 
domains separated from each other by a particle horizon during the last scattering 
epoch, is now isotropic to such a high degree. 

If one postpones looking for a physical mechanism explaining this puzzle, one has 
to assume very special initial conditions for the Universe which would a priori syn- 
chronize cosmological parameters (such as temperature of the background radiation) 
in regions separated by particle horizons from the very beginning. 

A program aiming at explaining the question of the “smoothness of the Universe” 


8A complete and connected Riemann manifold of constant curvature is said to be a ( Clifford- 
Klein) space form. 
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was proposed by Misner (1969). The idea was to find such mechanisms with the 
help of which it would be possible “to deduce” the actually observed state of the 
Universe from arbitrary initial conditions (this program has been named the miz- 
master cosmology). To this end one must (1) find world models in which a light 
signal would be able to go many times around the world till the present epoch, and 
(2) identify physical mechanisms which could effectively smooth out the Universe. 
Different mechanisms were proposed: among others dissipative effects connected with 
the viscosity of neutrinos or with the quantum process of the pair particle-antiparticle 
creation in strong gravitational fields in the early stages of world’s evolution. The 
heated discussion, which developed around the mix—master cosmology program, has 
shown that to satisfy condition (1) very special initial data are also required; therefore, 
it is not a good strategy to avoid special conditions. As far as condition (2) is 
concerned, it has turned out that the proposed mechanisms are either not efficient 
enough (neutrino viscosity) or too hypothetical (pair creation). In effect, the mix- 
master program is believed to be attractive but involved in so far insurmountable 
difficulties (see Barrow and Matzner 1977, Barrow 1978, Misner 1980, Raine 1981b, 
pp. 215-220). 

New perspectives to solve the puzzle appeared in connection with the grand 
unification theories. Guth (1981) noticed that the phase transition from the early 
symmetric Universe to the Universe with the spontaneously broken symmetry can be 
associated with a short period of rapid (exponential) expansion enlarging the size of 
the Universe 10°° times (or more). “Inflation” of this kind is caused by the vacuum 
energy which, during the phase transition, dominates the cosmological dynamics. 
This scenario, called the inflationary universe, was afterward modified several times 
(see Linde 1982, Hawking and Moss 1982, Barrow and Turner 1982). Inflation seems 
to liquidate the horizon problem: because of the exponential] expansion in its history, 
the Universe, observed at present, was never divided by particle horizons; it had 
originated from a tiny portion of the primordial plasma blown up to enormous size 
during the inflation period. 

Great hopes were connected with inflationary scenarios. However, the enthusi- 
asm which surrounded early versions of the model has recently slowed down. Some 
researches point out that physics, presupposed by inflation, is still dubious, and for 
solutions of some problems one is paid with new, and even more serious, problems 


(see Ellis and Stoeger 1988). 
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Bibliographical Essay 


Reading of the present book does not give an idea of what the contemporary cosmol- 
ogy really is. The book’s aim has been only to pose the cosmological problem and to 
discuss its fundamental assumptions. If the reader feels a little disappointed that he 
did not acquire the expected cosmological knowledge, the author is entitled to think 
that he has succeeded to reach his goals. However, wanting to help the reader in 
his further adventures with cosmology, the author feels obliged to add to this book a 
kind of guide throught the land of cosmological readings. It should be stressed that 
it is by no means a bibliography which could pretend to even partial completeness. 
The term “guide” seems to be a just expression; it is a guide, and — it should be 
added — a subjective one, to a great extent reflecting pathways travelled once by the 
author himself. 

The guide contains only references to the literature in the field of relativistic 
cosmology; books dealing with other cosmologies (such as neo-Newtonian, Milne’s 
or steady-state cosmology) are not included. With only one exception (Robertson’s 
review article of 1933) the reader will not find in it any reference to scientific papers 
published in journals. Because of the great richness and variety of such papers any 
attempt to compose a guide through them, or a list of them, would be both groundless 
and impossible. 

For reasons of clarity, cosmological literature will be divided into sections reflect- 
ing main stages of the evolution of cosmology in our century. 


A. Classical Period (from the origin of relativistic cosmology 


to the Second World War). 


First textbooks of general relativity appeared surprisingly early. To the earliest ones 
belong: 

H. Weyl (1922), Space — Time — Matter (German edition in 1921); 

A.5. Eddington (1923), The Mathematical Theory of Relativity. 

As far as mathematical standards are concerned, both these books are equal 
to many later publications, and the depth of their philosophical insight surpasses 
many newest elaborations. Numerous comments in these books are devoted to the 
problem of the relativity of space and time; the problem was still new and claimed for 
elucidation. Only some sections deal with “the Universe as a whole”; they represent 
Einsteins’s “cylindrical” world and de Sitter’s “spherical” universe. The fact that the 
two world models exist fascinates the authors. For the first time space-time begins 
to reveal its global image. 

About 1930, owing mainly to the works of Friedman, Lemaitre and Robertson, 
all spatially homogeneous and isotropic cosmological models were known. The first 
complete review of them can be found in the paper: 
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H.P. Robertson (1933), Relativistic Cosmology, 
published in the journal Reviews of Modern Physics. The article is important for 
two reasons: first, a few generations of physicists learned from it basic facts of rela- 
tivistic cosmology; second, appendix to the paper contains an almost complete list of 
cosmological papers published in 1917-1932. 

An extensive monographic textbook: 

R.C. Tolman (1934), Relativity, Thermodynamics and Cosmology, 
contains a detailed presentation of special and general relativity; in this aspect it 
is certainly a kind of textbook. Chapters that present pioneering works of Tolman 
himself are doubtlessly of a monographic character; they create the fundamentals of 
relativistic thermodynamics. The cosmological part of the book, in its composition, is 
similar to the paper by Robertson of 1933 but it puts more stress on the confrontation 
of theoretical models with astronomical observations. Applications of relativistic 
thermodynamics to cosmology are typical for this book. The clarity of Tolman’s 
work is exceptional. Even today scientists open it to check a formula or to find a 
forgotten, but still useful, mathematical expression. 

One should also mention another book which — although published after the 
Second World War — by its spirit belongs to the pre-war period: 

G. Lemaitre (1972), L’Hypothese de l’atome primitif (first edition in 1946; En- 
glish translation in 1950). 

The book is a collection of popular essays, published earlier by the author on 
various occasions. The value of the book lies in the fact that it shows the whole 
cosmological vision of Lemaitre. The hypothesis of the primeval atom was a first 
attempt to create an overwhelming cosmological system (from the initial singularity, 
through a cosmological theory of nucleosynthesis, to a theory of the origin of galaxies 
and of clusters of galaxies). In this hypothesis, for the first time, geometric theory of 
the world was supplemented with a reconstruction of physics of the early Universe. 

The book: 

O. Heckmann (1968), Theorien der Kosmologie (first edition in 1942) 
closes this period. In the exposition style it follows Robertson’s paper (1933). It also 
contains a list of cosmological publications of 1933-1940. 


B. Period of Stagnation (from the Second World War to the discovery 
of the background radiation in 1965). 


First an explanation. The term “stagnation” implies a certain slowing down of 
progress in cosmology, but it does not apply to the theory of relativity which had 
many successes in this period especially as far as its mathematical aspects were con- 
cerned. Of course, all these successes were relevant also for cosmology. Speaking about 
stagnation I have in mind a certain impasse in cosmological observations. In the fifti- 
eth discussion focused on polemics between defenders of the relativistic paradigm 
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(which was then called the “theory of the expanding Universe”) with the proponents 
of the steady-state cosmology, created in 1948 by Bondi and Gold, and Hoyle. 

A note of pessimism can be heard as early as in 1945 in the Cosmological Ap- 
pendix added by Einstein to the second edition of his The Meaning of Relativity (first 
edition in 1921). A small number of publications in the field of cosmology that are 
to be quoted in the present section also testifies about the stagnation. We should 
certainly mention 

H. Bondi (1960), Cosmology (editions in 1952, 1960, 1961). 

The book presents “on equal footing”: neo-Newtonian cosmology, relativistic 
cosmology, Milne’s cosmology, and steady-state cosmology. Cosmological conceptions 
of Eddington, Dirac, and Jordan are also briefly reported. A transparent discussion of 
the conceptual and methodological basis of cosmology makes this book still currently 
relevant. 

The following book, on the one hand, is a kind of summing up of formal achieve- 
ments of the pre-war cosmology but, on the other hand, contains a number of theo- 
retical results obtained already in this period: 

H.P. Robertson and T. W. Noonan (1968), Relativity and Cosmology. 

In spite of the date of its publication, this book doubtlessly belongs to the period 
before the discovery of the background radiation. After the tragic death of Robertson 
(in a traffic accident), it was compiled, from papers and notes left by him, by his last 
student, Thomas W. Noonan. The book was very carefully edited;! it shows the 
Robertsonian style of doing cosmology, in which a detailed analysis of space-time 
symmetries is as important as solving the field equations. 


C. Period of Dynamical Progress (after the discovery 
of the background radiation). 


The discovery of the microwave background radiation in 1965 together with other 
achievements of radioastronomy (quasars, radiogalaxies, pulsars) and later on also of 
optical astronomy brought new momentum into relativistic physics and especially into 
cosmology. Relatively quickly, the standard model began to consolidate. From the 
beginning of the seventieth the number of books in cosmology continuously increases. 
Only some of them can be quoted. Because of their great variety, I shall classify them 
into certain groups. 


1. Textbooks of general relativity. It is now a custom to devote a chapter or 
(more often as time passes) an entire part to cosmology. A handbook which has 
gained considerable publicity is 


C.W. Misner, K. Thorn, J.A. Wheeler (1973), Gravitation. 


At the end of the book the reader can find summaries of all chapters. The summary ends with 
the summary of the summary. . 
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It is a big volume (1279 pages). The style is highly didactic with the ambition 
to teach a beginner all the mysteries and intricacies indispensable to start research in 
this field. (In my opinion, didactic goals of this book are a bit exaggerated.) Many 
pages are devoted to cosmology, but a serious drawback (in my opinion again) is 
that the authors, motivated by philosophical reasons, almost exclusively focused on 
“closed” world models (with k = +1). 

The more recent book: 

B. Schutz, A First Course in General Relativity 
bears some similarities to Misner, Thorn and Wheeler’s volume, but is much more 
concise, easier, and with no tricky gadgets. It introduces the reader to understand- 
ing the content of the theory presented and to manipulating with its computational 
machinery. This book is my personal recommendation for an ambitious freshman. 

From among many other handbooks I would recommend the following to a reader 
who is prepared to make a more intimate acquaintance with cosmology: 

S. Weinberg (1972), Gravitation and Cosmology: Principles and Applications of 
the General Theory of Relativity. 

The fifth part of this book deals with cosmology. In my view, only two features 
separate the book from the ideal: first, the author unnecessarily avoids the modern 
coordination free notation; second, the book was written several years ago, and in 
a few points it would require modernization. Nevertheless, the principles of physics 
do not change so rapidly and Weinberg’s book is still an excellent source to learn 
relativistic physics and cosmology. 

Let us also quote: 

W. Rindler (1968, 1977), Essential Relativity: Special, General, and Cosmologi- 
cal. 

R. Adler, M. Bazin, M. Schiffer (1965), Introduction to General Relativity. 

The first of these books, providing more imaginative pictures, focuses on the 
problem of the relativity of space and time (Mach’s principle), and from the second 
the reader can learn many useful mathematical techniques. 

L.D. Landau, E.M. Liphschitz (1973), The Field Theory (in Russian). 

This edition of the famous book has a significantly enlarged cosmological part as 
compared with previous editions. A serious drawback, from the cosmological point 
of view, is the absence of observational aspects of cosmology. A chapter devoted to 
cosmology can be found in the book 

M. Demianski (1985), Relativistic Astrophysics. 

The following book is of an exceptional character, 

R.S. Sachs and H. Wu (1977b), General Relativity for Mathematicians. 

It adopts a quasi mathematical style of presentation. However, mathematical 
rigors break down when a physical reality has to be put into formulae. In the part 
dealing with cosmology, the Einstein-de Sitter world model is mainly exploited (it is 
a spatially isotropic and homogeneous world model with k = p = A = 0). 

Very useful for every relativity student is the collection of exercises: 
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A.P. Lightman, W.H. Press, R.H. Price and S.A. Teukolsky (1975), Problem 
Book in Relativity and Gravitation. 
Chapter 19 is on cosmology (solutions to problems are included). 


2. Handbooks of cosmology. a) Elementary level: 

(1) Understanding Space and Time, Block 5: Cosmology (1979), 

(2) M. Berry (1976), Principles of Cosmology and Gravitation, 

(3) M. Rowan-Robinson (1977), Cosmology, 

(4) D.W. Sciama (1975), Modern Cosmology, 

(5) J. Heidmann (1973), Introduction a la cosmologie, 

(6) P.T. Landsberg, D.A. Evans (1977), Mathematical Cosmology — An Intro- 
duction. 

The above books are enumerated, roughly speaking, in the order from the easier 
to the more difficult. I would advise one to make the first encounter with cosmology 
with the help of (1) or (2). (1) is a publication of the British “Open University”, 
(2) is an excellent textbook for beginners. (3) deals more with the observational 
side of cosmology, and (4) nicely presents mutual dependencies of cosmology and 
astrophysics. (5) is a sample of the French approach to cosmology. By using (6) one 
can acquire a proficiency in “mathematical cosmology” without going beyond the 
fundamental level of higher mathematics. Although the book deals, in principle, with 
the neo-Newtonian cosmology only, it gives a good picture of cosmological research. 

b) Advanced level: 

(1) D.J. Raine (1981b), The Isotropic Universe, 

(2) P.J.E. Peebles (1971), Physical Cosmology, 

(3) J.B. Zeldovitch and I.D. Novikov (1975), Structure and Evolution of the 
Universe (in Russian). 

(4) A.D. Dolgov, I.B. Zeldovitch and M.B. Sagin (1988), Cosmology of the Early 
Universe (in Russian). 

(5) M.P. Ryan and L.C. Shepley (1975), Homogeneous Relativistic Cosmologies. 

These books are also enumerated in the order of increasing difficulty. (1) is 
a good introduction to, and overview of, cosmology. (2) and (3) are attempts at 
reconstructing physical processes in the evolving Universe. (3) is a mosaic of var- 
ious cosmological problems rather than a textbook of cosmology, albeit a mosaic 
providing the general picture. (4) originated from courses given by Zeldovitch at 
the Moscow University, and was composed by his co-workers after the death of this 
eminent cosmologist; it gives a very penetrating account of recent problems and re- 
sults in cosmology, such as: inflation, grand unification, quantum gravity, creation of 
the Universe as a quantum tunneling process, ... (5) is mathematically elegant and 
explores theoretical aspects of Bianchi cosmology (homogeneous but not necessarily 
isotropic world models). This book contains a rich catalogue of cosmological publi- 
cations till 1974, which can be considered as a continuation of similar lists composed 


by Robertson (1933) and Heckmann (1968) NOW EV eH now without any ambition of 
completeness). 
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3. Monographs. 

(1) P.J.E. Peebles (1980), The Large Scale Structure of the Universe, 

(2) S.W. Hawking, G.F.R. Ellis (1973), The Large-Scale Structure of Space- 
Time, 

(3) J.K. Beem, P.E. Ehrlich (1981), Global Lorentzian Geometry, 

(4) D. Kramer, H. Stephani, E. Herlt, M. MacCallum (1980), Exact Solutions of 
Einstein’s Field Equations. 

(1) deals with the problem of the origin and evolution of structures in the Uni- 
verse (such as galaxies, clusters of galaxies, ...). Works which gave birth to book (2) 
have initiated many new research programs in relativity: singularity theorems, causal 
structure of space-time, global structure of solutions to Einstein’s field equations, and 
so on. This monograph closes a certain stage of discussions concerning the classical 
(non—quantum) singularity problem. (3) elaborates principally the same problems as 
(2) but in a more mathematical style. (4) is an indispensable source for studying 
solutions of Einstein’s equations. 


4. Books on the XX** century history of cosmology. 

(1) J.D. North (1965), The Measure of the Universe, 

(2) J. Merleau-Ponty (1965), Cosmologie du XX* siecle, 

(3) O. Godart and M. Heller (1985), Cosmology of Lemaitre. 

(4) E.A. Tropp, W.Ya. Frenkel, A.D. Chernin (1988), Alerander Alerandrovitch 


Friedman (in Russian). 


5. Books on philosophical aspects of relativity and cosmology. 

(1) J.D. Barrow and F.J. Tipler (1985), The Anthropic Cosmological Principle, 
(2) D.J. Raine and M. Heller (1981), The Science of Space-Time, 

(3) R. Toretti (1983), Relativity and Geometry, 

(4) M. Friedman (1983), Foundations of Space-Time Theories, 

(5) J.D. Barrow (1988), The World Within the World, 

(6) M. Heller (1986), Questions to the Universe. 


6. Popular books on cosmology. Since a great number of such books appears 
each year, I choose, more or less at random, only a few: 

(1) J. Silk (1980), The Big Bang. Creation and Evolution of the Universe, 

(2) E.R. Harrison (1981), Cosmology — The Science of the Universe, 

(3) S. Weinberg (1977), The First Three Minutes, 

(4) R. Geroch (1978), General Relativity from A to B, 

(5) A.G. Pacholezyk (1984), The Catastrophic Universe, 

(6) J. Silk and J.D. Barrow (1983), The Left Hand of Creation, 

(7) J.S. Trefil (1983), The Moment of Creation. 


My personal recommendation would be (1), (6) and (7). 
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Appendix 


Evolution of Dynamical Space—Time Theories 


0. Introduction 


General relativity is something more in contemporary physics than one of the many 
theories describing a certain domain of the world. A distinguished role of Einstein’s 
theory of gravity can be reduced to two circumstances: first, this theory provides a 
space-time stage to many other physical theories and, second, it does so not only 
locally, in the neighbourhood of an investigated event, but also globally when, for 
instance, it deals with inextendible space-times, i.e., with such space-times which 
cannot be embedded into “larger” ones. Doubtlessly, the last function of general 
relativity appeals to aesthetic feelings of physicists; however, its significance in our 
system of knowledge is both more prosaic and more important: without it the correct 
statement of many concrete problems of physics and astronomy would be impossi- 
ble. Contemporary observational astronomy controls such extensive regions of the 
Universe that with no help of global (or at least large scale) theory of space-time 
there could be no question of distilling any information carried by signals travelling 
through space and time. We have also seen in previous chapters how cosmology be- 
comes unavoidable as far as the program of unification of physics is concerned (see 
chapter 3, sec. 0). 

Science always tended towards a synthesis of its various theories. In previous 
paradigms, there were dynamical theories of mechanics that played the role of prepar- 
ing spatio-temporal stage for eventual cosmological synthesis. This was the case with 
both Aristotelian and Newtonian images of the world. In contemporary science this 
function has been taken over, and much sharpened, by general relativity. 

After having read all previous chapters, only a slight mathematical preparation 
would be enough in order to see, in a more detailed way, how the evolution of views 
on the space-time structure has led to the present distinguished position of general 
relativity in science. This could serve us as an additional argument on behalf of this 
theory — the argument from a “logic of development” of science. Since, on the one 
hand, analyses of this kind do not fit the main line of reasoning adopted in this book 
and, on the other hand, it would be a great negligence to omit an occasion to present 
the place of general relativity in the evolution of our views on the Universe altogether, 
we will do this in the present Appendix. 

To do so, we must prepare certain concepts from the mathematical theory of 
categories. This theory allows one to organise large parts of contemporary mathe- 
matics in the spirit similar to that of the Erlangen Program proposed by F. Klein as a 
framework for unifying different geometric theories. Some mathematicians share the 
opinion that this announces a revolution in mathematics analogous to that initiated 
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by Cantor’s work (see Semadeni and Wiweger 1978, p. 13). No wonder, therefore, 
that the application of the categorial style of thinking to the analysis of physical 
theories not only turns out to be an attractive possibility, but also begins to bring 
fruits in the form of significant sharpening of some concepts and of revealing logical 
dependence between present physical theories and their predecessors (see Trautman 
1973b). 

Finally, one should stress that analyses carried out in the present appendix are 
of no strict historical character; they are rather a stylization of the history of sci- 
ence: they try to look at former theories in the light, and with the help, of modern 
mathematical tools. Such an approach enables one better to see a logic of the evolu- 
tion of science, and consequently, more responsibly to predict perspectives of further 
development. 

Such analyses have been carried out by many authors (see bibliographical note 
at the end of the appendix). Of course, we shall make ample use of them (especially 
of our own study, see Raine and Heller 1981); however, we are in a better position 
than all previous authors inasmuch as we can base our analysis upon an advanced 
mathematical machinery prepared throughout the entire book. Owing to this, we are 
able to significantly shorten the “explanatory text” and provide a deeper insight into 
the nature of analysed problems. 

After presenting elementary concepts of the mathematical category theory (in 
sec. 1), we shall present the structure and dynamics of space-time in the Aristotelian 
theory (sec. 2), in classical mechanics without gravitational field (sec. 3), and in 
classical mechanics with gravitational field (sec. 4). The evolution of these space- 
time structures naturally leads to special and general relativity theories (sec. 5). 
Some branching from the main evolutionary trunk, namely certain ideas coming from 
Berkeley, Leibniz and Mach, together with prospects for future progress, will be 
presented as Comments and Remarks closing the Appendix (sec. 6). 


1. Elementary Repetition of the Category Theory 


1.1. Definition. We say that a category A is given if there are 

(1) aclass A®° (not necessarily a set'), elements of which are called objects; 

(2) a function which to every ordered pair of objects (4’, ¥), A, Y € A® ascribes a set 
Mor (4, Y), elements of which are called morphisms from 4 to Y; additionally 

(3) a composition of morphism 


Mor (4’, ) x Mor(Y,V) —> Mor (4, V) 
is defined by 
(fig) > h=gof, 


where f € Mor(4’,)), g € Mor(),V), h € Mor(V,V); the composition of 
morphisms is assumed to satisfy the following conditions, 


lfm the sense of the axiomatization of the set theory proposed by Godel (see, Semadeni and 
Wiweger 1978, pp. 21-22). 
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(a) associativity: j o(go f) =(j0g)of, where j € Mor(V,U), and f and g are as 
above; 

(b) the existence of morphism identity: for every object ¥ € A° there exists one 
and only one morphism Mor (4, 4%), denoted by Idx, such that for every object 
Y € A® one has 


fold, =f, for every f € Mor(¥,y), 
Id,og=g, for every g € Mor(), +). 


1.2. Example. A category the objects of which are any sets and morphisms — 
any mappings is called Ens. 

1.3. Example. A category the objects of which are any topological spaces, and 
morphisms — continuous mappings is called Top. 

1.4. Example. A category the objects of which are arbitrary, but finitely di- 
mensional C* differential manifolds and morphisms — arbitrary C* diffeomorphisms 
is called C*Diff.? 

1.5.-1.7. Examples. Analogously, one speaks about categories Vect, Gr, and 
Ab, the objects of which are vector spaces, groups, Abelian groups, and morphisms 
~— any linear mappings and any homomorphisms, correspondingly. 

1.8. Example. Let us recall that an affine space is a triple A = (€, V, +), where 
E is any set, V a vector space, and + a mapping +:€ x V — € such that V acts as a 
transitive and free group of transformations of € (for more details consult Schwartz, 
vol. I 1979, pp. 185-199). Let A; = (€&,¥i,+) and Az = (&2, V2, +) be affine spaces. 
Let us define a mapping f:€ — & such that there exists a corresponding mapping 
Tf: Vi — V2 satisfying the condition 


f(u+p)=rf(v) + f(p), 


for any p € €; and v € V,. The mapping f is said to be an affine morphism. The 
class of affine spaces together with affine morphisms as morphisms forms a category 


called Aff. 


1.9. Example. A category, the objects of which are any metric spaces and 
morphisms — any contractions, i.e., mappings a: ¥ — Y such that 


py (a(x1), a(22)) < px(21, 22), 


where px and py are metrics on spaces X and ) correspondingly, 11, 22 € 1, is called 


Metr. 


1.10. Example. A category the objects of which are fibre bundles and mor- 
phisms — bundle morphisms (see chapter 3, sec. 2), is called Bun. 

1.11. Definition. A category B is a subcategory of a category A if the objects 
and morphisms of B are also objects and morphisms of A, and if the compositions 


?In the case of C© Diff, we shall simply write Diff. 
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of morphisms in B are restrictions of the corresponding compositions of morphisms 
in A. 

For instance, Ab is a subcategory of Gr. 

1.12. Definition. A morphism f € Mor (4’, y) is said to be an isomorphism if 
there exists another morphism g € Mor (), 4’) such that fog = Idy and gof = Idx. 

For instance, bijections are isomorphisms in Top, isometries — in Metr, group 
isomorphisms — in Gr and Ab. Having a category A, one can define a category ¢A, 
the objects of which are those of A, and morphisms — the isomorphisms of A. 

In the category theory, morphisms play, in a sense, a more important role than 
objects. One can construct a category axiomatically in such a way that its primitive 
concepts, besides logical constants and the equality sign, are (1) “a is a morphism”, 
and (2) “composition of morphisms ao is defined and equal y”. The axioms guaran- 
tee the associativity of compositions and the existence of identity morphisms. Since 
the concept of objects does not appear in the axioms, one speaks of an objectless 
category theory. It can be shown that a category defined at the beginning of the 
present section is a model of the axiomatic objectless category theory (see Semadeni 
and Wiweger 1978, pp. 44-46). 

An important concept in the category theory is that of functor. 

1.13a. Definition. Let A and B be two categories, A° and B° — the classes of 
their objects, and Mor (A) and Mor (B) — the classes of their morphisms. A covariant 
functor from the category A to the category B is a mapping 


C: (A, A°, Mor(A)) —> (B, B°,Mor(B)), 


satisfying the following conditions: 
1) if f € Mor(4#,) € Mor(A), 7,Y € A®, then C(f) € Mor(C(4),C()) € 
Mor (B), C(¥),C(¥) € B®; 
2) if f,g € Mor(A) and f can be composed with g, then C(g) oC(f) =C(go f) € 
Mor (B); 
3) C(Idx) = Idorx), XE vA? C(*) <€ len Idx € Mor (A), Idecx) € Mor (B). 


The following diagram elucidates the structure of the above definition: 
ee ye 
Be cia" e0) “4 c(zZ) 


The symbol C(f) is often replaced by the symbol f,. To simplify the notation 
we shall also write C: A > B. 

1.13b. Definition. A contravariant functor from the category A to the cate- 
gory B is obtained by introducing to definition 1.13a modifications visualised in the 
following diagram: 


a yee 


B: COX eee) 22ae(Z) 
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The symbol C(f) is often replaced by the symbol f*. 
1.14. Example. A mapping 


C:. (CDi, (X",2),f) —> (C'S) Diff, Tx(X),f(z)), 


where X” is any n-dimensional C* manifold, Tx(X") — the tangent space to X” at 
a point r € X”, and f’ — the derivative mapping of the mapping f transforming a 
C* manifold into a C* manifold, is a covariant functor from C*Diff to C*—! Diff. 

The category theory has enormous ability to order mathematical structures and 
make precise concepts which has so far been used intuitively. The notion of naturality 
belongs to such concepts. Eilenberg and MacLane considered the formalization of 
this notion as a main motive of creating the category theory (this can be seen in 
the title of their pioneering work; see Eilenberg and MacLane 1945). Even if further 
developments of the theory transcended this goal, it remains true for the present 
Appendix. Strict definition of the naturality concept is essential to grasp logical 
dependencies between different space-time structures. 

1.15. Definition. Let C; and C, be two functors of the same variance (i.e., 
either both covariant or both contravariant) from a category A to a category B: 


Ci: A— B, 

Ca: A — B. 
A natural mapping (called also a morphism functor) of C, into C2 is a mapping 

N: A° —+ Mor(B), 
ascribing to every object A € A° a morphism of the category B: 
A +> N(A) € Mor(B), 

in such a way that 
It is also assumed that, for every f: A; — A2, f € Mor(A), Ay, Az € A®, the 


following diagram is commutative (it is assumed that all the required compositions 
are meaningful) 


G 
C,(A,) a C(A2) 
NA) N(A2) 
Cf) 


C,(A,) —————_—_» C,,(A,) 


(all in this diagram occurs in the category B). If, additionally, for every A € A®° 
the morphism (A) is isomorphism in B, then the functors C, and C, are said to be 
naturally equivalent. 
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1.16. Example. Let us consider a contravariant functor *: Vect > Vect, defined 
by f*(9) =90f for any mapping f: A > B, g:B > R (f* is called a pull back of the 
function g by f; see chapt. 1, sec. 4). Analogously, we define f** (a pull back of a 
pull back), and a covariant functor **: Vect — Vect. Now, let A = B = Vect, C, = Id, 
C, = **. The mapping V(V):V > V**, V € Vect® is a natural transformation as it 
can be easily seen from the commutative diagram, valid for any f: V > U,U € Vect® 


aon OVE! cay a 
NV) na) 
cw)-v LDF o@meus 


If we limit our considerations to the category of finitely-dimensional vector 
spaces, the functors C; = Id and C, = ™ are naturally equivalent (see Trautman 
1973b, p. 185). 

Definitions of natural transformation and natural equivalence can be “naturally” 
extended to the case of functors of many variables (see Semadeni and Wiweger 1978 


(pp. 134-136). 


2. Aristotelian Space-Time 


Dynamical theories investigate motions of bodies under the influence of forces. The 
first goal of a dynamical theory is to provide a criterion allowing one to determine 
whether any force has appeared. The role of such a criterion is usually played by the 
definition of “free motions”, i.e., motions which are going on without any intervention 
of forces: A deviation from a free motion testifies to the appearance of a force. The 
formulation of such a criterion is the content of the first law of dynamics. If one 
assumes that motions under no influence of forces are determined by the structure of 
space-time — and this assumption seems to be very natural — then the first law of 
dynamics contains in itself information about this structure. Geometry, interpreted 
in such a way, becomes a part of the given dynamical theory, and, as it belongs to 
physics, is subject to empirical verification. 
In Aristotle’s dynamics? the first law can be formulated as follows: 


There exists the natural rest, 


or, equivalently, 


3It is less known that Aristotle formulated a complete dynamics, and attempted at doing it in a 
quantitative manner (see Aristotle’s Physics, especially the final parts of book VII). From our present 
point of view the Aristotelian “physics” is a mixture of philosophical views, purely verbal analyses, 
empirical statements (often erroneous ones), and a natural knowledge based on them. One could 
distillate from this a part which would correspond to what is today called physics. It goes without 
saying that the basic principles of the Aristotelian physics have been experimentally falsified. 
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a body, not acted upon by any force, remains in the absolute rest.4 


This law implies the structure of space-time in the following way. The absolute 
rest allows one to meaningfully speak about the same place at different times (a body 
in the state of absolute rest always occupies the same place), and consequently it 
determines a projection ty from the four-dimensional space-time A into a three— 
dimensional manifold Y. The existence of the Aristotelian absolute time makes it 
possible to meaningfully speak of the absolute simultaneity of events (even if they are 
distant from each other), and consequently it defines a projection mr of the manifold 
A into the “axis of absolute time” T. The existence of the projections ty and m7 is 
equivalent to the existence of the absolute space and the absolute time, respectively. It 
follows that the Aristotelian space-time A is a product bundle (A = 3 xT, Y, wz).° 
One could easily show that this structure is natural for the Aristotelian space-time 
or, more strictly, that between suitably defined category of Aristotelian space-times 
and the category of product bundles there exists a natural transformation. Proof of 
this fact is similar to the proof which will be carried out in detail for space-time of 
classical mechanics. 

Ascribing to the sets 1’, T and A the structure of smooth manifolds is a styl- 
ization of the history, but a stylization not too far exaggerated, if one takes into 
account the fact that the contemporary differential manifold notion directly comes 
from the Aristotelian concept of continuum for which ideas of space and time in turn 
constituted its prototypes. 

The structure of the Aristotelian space-time is invariant with respect to the 
direct product of dilatation, translation and rotation groups (acting on 2’) by an 
affine group (acting on T). 

The appearance of absolute velocity with respect to the absolute space testifies 
to the intervention of a force. In Aristotle’s views, this force is proportional to the 
velocity produced by it. This statement can be thought of as to form the second law 
of the Aristotelian dynamics. However, we shall skip this problem as marginal for 
our discussion.® 


3. Space-Time of Classical Mechanics Without Gravitational Field 


Reconstruction of the space-time structure presupposed by classical mechanics is 
especially instructive. It not only shows that the structures, discovered later on by 


“One should remember that Aristotle assumed the existence of the so-called natural motions 
(e.g. motions of heavy bodies towards the centre of the Earth). Of course, we could include such 
motions into our reconstruction of the Aristotelian dynamics, but this would highly complicate the 
picture without throwing any new light on questions we are interested in. 

®One can also say that the Aristotelian space-time A is the Cartesian product)A =p. 

5More precise formulation of the second law of the Aristotelian dynamics would require a far- 
reaching historical stylization. It would mainly consist in translating some terms, used by Aristotle 
in their every -day meanings, into their counterparts elaborated much later in order to correctly 
express laws of dynamics. For instance, the Aristotelian term “efficient factor” should be replaced 
by “force”, “heaviness” by “mass”, etc. (See also Toulmin and Goodfield, 1961, pp. 93-100.) 
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relativity theory, existed and were well functioning from the very beginning of modern 
physics, but also demonstrates how quickly a physical theory gains independence from 
the philosophical views of its inventor and begins to lead an autonomous life on its 
own. Newton ascribed to space and time, which appeared in mechanics created by 
himself, Aristotelian properties, that is to say he treated them as absolute ones. It 
turns out that this remains correct as far as time is concerned, but is false with regard 
to space. One should distinguish the structure of space-time as postulated by Newton 
himself and the structure of space-time as presupposed by classical mechanics. The 
former is often called Newton’s space-time (see Ehlers 1973), the latter Galileo space- 
time (see Kopczynski and Trautman 1981, Trautman 1970, 1973b). In the following 
we shall speak simply of space-time of classical mechanics. This seems to be justified 
since there is a necessity to distinguish the space-time structures of classical mechanics 
without gravitational field and with gravitational field. Now, we are going to analyse 
the first of these two cases, postponing the second one to the next section. 


The first law of the Newtonian dynamics establishes a democracy among all 
inertial reference frame: none of such frames is privileged in any way. The standard 
of the absolute rest disappears from the theory. The Aristotelian projection my ceases 
to be a natural structure. Every observer, connected with a reference frame, has his 
private standard of rest (“with respect to his own inertial frame”); that is to say, 
his ewn projection corresponds to the Aristotelian projection ty. However, absolute 
simultaneity and absolute acceleration remain valid in the classical world picture. 
Absolute simultaneity defines “spaces of equal time”, and consequently leaves the 
Aristotelian projection ty unchanged. Absolute acceleration connects the velocities 
at different points of space-time with one another: vanishing of acceleration allows 
one to meaningfully speak of “the same velocity at space-time points separated from 
one another by non-zero distances” (but not of the zero absolute velocity in different 
inertial frames). This leads to the so-called teleparallelism, i.e., to the concept of 
parallel vectors at points separate from one another in space-time.’ Therefore, one 
must assume that space-time of classical mechanics (without gravitational field) is 
a four-dimensional differential manifold equipped with an affine connection which 
makes meaningful parallel transport of vectors (teleparallelism). It is natural to 
consider this connection in terms of a principal fibre bundle over space-time. Since 
any local inertial frame of reference can be identified with a basis at a given space— 
time point, the principal fibre bundle in question should be concretized to the bundle 
of frames over space-time. Let us consider this structure in more detail. 


An affine space A = (E€, V, +) (see example 1.8) equipped with a bilinear map- 


7Let us turn our attention to the fact that the absolute character of acceleration in classical 
mechanics does not imply the existence of an absolute space. From the historical point of view, 
the appearance of “absolute accelerations” constituted the main Newton’s argument on behalf of 
the existence of absolute space. This argument was the focus of heated discussions around the 
conceptual structure of classical mechanics (see Raine 1981a, p. 1157). 
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ping h:V* x ¥* — R!, such that A is a symmetric positively defined mapping of 
rank n — 1, where n is the dimension of the vector space V, will be called space-time 
of classical mechanics without gravitational field and denoted by N. We shall write 

Let M, = (E1, Vi, +; Ai) and N2 = (2, V2, +; he) be two such space-times. An 
affine morphism f will be called Galileo’s morphism if hz = hy o (tf)*, where the 
asterisk denotes the dual functor. The class of space-times of classical mechanics 
without gravitation with Galileo morphisms as morphisms is a category which will 
be denoted by Gal; it can be thought of as a subcategory of the category Diff. A 
Galileo’s morphism, which is an automorphism, is called a Galileo’s transformation. 

Let S C Y* be defined as a subspace for which h = 0, and SL as a subspace 
of vectors orthogonal to S. Let mr:€ — €/S =T be a canonical projection. Now, 
N can be presented as a fibre bundle (€, T, wr, S), where S is a typical fibre. A 
functor o:Gal — Bun can be defined from the category of spacetimes of classical 
mechanics without gravity to the category of bundles 


a(N) = (E, fr; IT, S), 

o(f) =(f,4), 
where f:&, — & is a Galileo’s morphism and a:T,; — T> a uniquely determined 
mapping satisfying the condition: [7,0 f = aompz,. 

Now, let us define another functor w: Gal —> Bun to every space-time NV € Gal® 
ascribing a product fibre (T x S, T, rr). It turns out that there is no natural trans- 
formation between functors o and w. This statement is mathematical counterpart 
of the intuitive statement that space-time of classical mechanics (without gravity) is 
not a Cartesian product of absolute time and absolute space. 

The existence of teleparallelism in space-time determines horizontal cross— 
sections of the frame bundle over space-time. Indeed, all frames, resulting from 
a parallel transport of an arbitrarily chosen frame at a certain point of space-time, 
smoothly select a single point of space-time in each fibre of the frame bundle. Intu- 
ition suggests that in this way the natural product structure, lost in space-time, can 
be recovered in the fibre bundle of frames over this space-time. We will see that this 
is indeed the case. 


Let us define the following two functors: the functor 
B: gAff — PBun 
from the category of affine spaces to the category of principal fibre bundles, associating 


with every object A = (€,V, +) € Aff? a frame bundle B(E€) € PBun, and the 


functor 


C: gAff —> PBun, 
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associating with every A = (€, V, +) a product bundle C(.A) = Ex B(V). These func- 
tors are naturally equivalent as it can be easily seen from the following commutative 
diagram for every f:€ > €2: 


B(E,) ES B(E2) 
N(E,) N(E2) 
cif) 


C(E,) = £, x B(V;) ———— C(E£,) = E, x B(Y) 


where C(f) = f x B(rf), and N is a natural transformation of the functor B into the 
functor C. Therefore, in classical mechanics without gravity the product structure 
is naturally transferred from space-time to the frame bundle over space-time (see 
Trautman 1973b). 


4. Space-Time of Classical Mechanics with Gravitational Field 


At the basis of the Newtonian theory of gravity lies the statement which deserves the 
name Galilean principle of equivalence; it asserts that in a gravitational field all bodies 
fall with the same acceleration. As is well-known, this principle allows one to treat 
freely falling reference frames as locally non-accelerating, that is to say, reference 
frames in which laws of the Newtonian dynamics are locally valid (see Raine and 
Heller 1981, p. 83-84). However, the presence of a gravitational field causes focusing 
(or antifocusing) of space-time trajectories of freely falling bodies. For instance, the 
trajectories of bodies freely falling on Earth should meet at the point corresponding 
to the centre of the Earth. This effect is connected with the appearance of tidal forces. 
These forces liquidate the existence of teleparallelism: now, parallel transport depends 
on the path along which it is performed. We express this by saying that the affine 
connection over space-time of classical mechanics with gravity is non-integrable. This 
means that in the frame bundle over space-time there are no global cross-sections 
(in the case without gravity such cross-sections were implied by teleparallelism which 
now does not exist), and consequently the connection is not~flat (see chapt. 3, sec. 5). 
In other words: the presence of gravity manifests itself in the existence of a non- 
vanishing curvature form on the frame bundle over space-time. 

The non-existence of global cross-sections in the frame bundle over space time 
suggests that this bundle, in the presence of a gravitational field, cannot be expressed 
in the form of a Cartesian product (globally). Indeed, in this case, there are no 
naturally equivalent functors associating with the space-time a trivial frame bundle. 


(See Raine 1981a, Raine and Heller 1981, chapter 6.) 
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5. The Special and General Theories of Relativity 


Further logic of the evolution of space-time strucures is almost automatic. Space- 
time of special relativity (see chapter 2, sec. 3) has no product structure. Moreover, 
it has no natural fibration into global spaces of equal time (i.e., it has no projection 
mq which would be independent of the frame of reference) that existed in space-time 
of classical mechanics. 

Space-time of special relativity — in contradistinction to spacetime of classical 
mechanics — carries a Lorentz structure and its affine connection is consistent with 
this structure. It is a flat connection which makes it possible to express the bundle 
of (orthonormal) frames in the form of a Cartesian product. 

In general relativity (see chapter 4) the frame bundle over space-time has a non-— 
vanishing curvature form which implies that this bundle is not, in any natural way, 
a trivial one. The essential difference between the space-time structure of classical 
mechanics with gravitational field and that of general relativity consists in that the 
latter is equipped with a Lorentz metric. In the architecture of the frame bundle this 
difference manifests itself in the existence of the soldering form which does not exist 
in classical mechanics. 

The existence of a linear connection in the frame bundle B(M) over space-time 
M is responsible for the fact that the frame bundle B(B(M)) over the bundle B(M) 
is — in a natural way — a Cartesian product. This fact might suggest that the next 
step in the evolution of space-time theories could consist in depriving the bundle 
B(B(M)) of its product structure and in the physical interpretation of this procedure 
(see Trautman 1970a, 1973b). 

It is worth noticing that, when discussing the naturality problem in the case of 
general relativity, one should take into account the category of principal fibre bundles 
with linear connections. A connection morphism is a morphism of principal bundles 
such that the image under this morphism of any horizontal curve is a horizontal curve 
(see Sulanke and Wintgen 1977, p. 133). 

The diagram on the adjacent page shows the logic of evolution of space-time 
structures (it is a modified version of the diagram appearing in the two works of 
Trautman quoted above). 


6. Comments and Remarks 


A. The logic of evolution of space-time theories, shown in the present Appendix, is 
a consequence of the strong stylization of the history of physics. It was the recon- 
struction of space-time structures of various theories in terms of modern differen- 
tial geometry that has allowed us to see the logical connections between subsequent 
stages of the evolutionary process. The view that such a reconstruction reflects, at 
least some, real features of development of science seems well founded. On the other 
hand, however, there can be no doubt that the true history of science was much more 
complicated and much less logical: it was like a tree with the main stem and many 
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side-branches rather than a logical chain the parts of which smoothly change from 
one into another. ; 

For example, the ideas of Berkeley and Leibniz, later on developed by Mach, 
constituted an important side~branch which could be thought of as a concurrence 
with respect to the Newtonian main stem (see Raine and Heller 1981, chapter 4; Bar- 
bour 1983). The essence of these ideas is contained in the postulate that the entire 
structure of space-time, and consequently the whole of dynamics, should be reduced 
to relationships between bodies without assuming any “absolute” manifold (see chap- 
ter 1, sec. 6, B). An attempt to construct such a dynamics has been undertaken by 
Barbour and Bertotti (1977). These authors believe that laws of the new dynam- 
ics should be invariant with respect to the following transformation group called the 
Leibniz group, 


ri Air +9(2), 
tr— fit), 


where A(t) is an orthogonal matrix, and f and g are any smooth functions of time 
(additionally, one assumes that f is a monotonic function). The existence of the 
“universal time” ¢ (the existence of the projection m7) is justified by the Leibniz 
postulate of the “coexistence of events”. The Leibniz group not only admits the 
rescaling of time, consisting in choosing new units and a new beginning of time 
counting (as did the Galileo group in classical mechanics), but also more general time 
transformations. The Leibniz group does not distinguish uniform motions; it also 
admits any suitably smooth rotations of instantaneous spaces. The Galileo group is 
a subgroup of the Leibniz group. (See also Liebscher and Yourgrau 1979). 


B. In the philosophy of science there is a heated polemics in which the following 
views fight one another (a) the evolution of science goes on more or less logically, 
and the “time arrow” of this evolution is determined by the continuous accumulation 
of results (Popper, Lakatos); (b) strictly speaking, there is no evolution, but periods 
of “normal development” are separated from each other by all—changing revolutions; 
“normal periods” from before and after a revolution are “incommensurable” (Kuhn); 
(c) one should stop speaking about the progress in science since science is not ruled 
by logic but by laws of sociology and psychology of scientists (Wilson) or by the 
sole rule “anything goes” (Feuerabend). The polemics is often referred to in the 
literature as dispute on the rationality of science or of its evolution. The ideas briefly 
signaled in (c), which try to reduce all rationality criteria to social conditions changing 
from epoch to epoch, have recently grown in number and popularity (for a review 
of this dispute see for instance, Zyciriski 1988). The analyses, carried out in the 
present Appendix, throw some light on this problem. In spite of the strong historical 
stylization, which was indispensable in order to translate older space-time theories 
into the language of the present scientific paradigm, one cannot ignore the very fact 
that earlier theories can be translated into the modern language, and that in this 
process surprising logical dependencies are revealed within both the structures of 
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particular theories and the structure of the entire evolutionary chain. Of course, it 
is true that in this way our present criteria have been adapted to the former theories 
in such a manner that the whole procedure could succeed. Nevertheless, the result of 
this analysis should be considered as philosophically significant. 


Bibliographical Note. Elements of the mathematical category theory could be learned, for 
instance, from Dodson (1980) or Semadeni and Wiweger (1972, 1978). Applications of this theory to 
differential geometry can be found in Sulanke and Wintgen (1972), and to the analysis of space-time 
structures in the works by Trautman (1970a,b, 1973b). Mathematical aspects of the classification of 
space-times (in terms of bundles which can be associated with the frame bundle over space-time) 
are presented in Trautman’s paper (1976). 

Reconstruction of space-time theories in the language of modern differential geometry has been 
carried out in the following works: Penrose 1968; Ehlers 1973; Trautman 1968, 1970a, 1971, 1973b; 
Kopezynski and Trautman 1981, Thirring 1978 (chapter 6); Raine and Heller 1981; Toretti 1983; 
Friedman 1983. Toretti’s book contains some historical comments; Friedman focuses on philosophical 
aspects. One should take into account the different terminological conventions adopted by various 
authors, which sometimes could give the misleading impression that results of the analyses lead to 
contradictory results. 
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